80960KB Hardware Designer's 
Reference Manual 




Order Number: 270564-001 



InteT 



LITERATURE 



To order Intel literature write or call: 

Intel Literature Sales 

P.O. Box 58130 

Santa Clara, CA 95052-8130 



Toll Free Number: 

(800) 548-4725* 



Use the order blank on the facing page or call our Toll Free Number listed above to order literature. 
Remember to add your local sales tax and a 10% postage charge for U.S. and Canada customers, 20% for 
customers outside the U.S. Prices are subject to change. 

1988 HANDBOOKS 

Product line handbooks contain data sheets, application notes, article reprints and other design information. 



**PRICEIN 
ORDER NUMBER U.S. DOLLARS 



231003 

231792 
210997 
210918 



210461 



210620 



$125.00 
$20.00 
$20.00 
$23.00 



210830 


$18.00 


231658 


$22.00 


230843 


$25.00 



$18.00 



NAME 

COMPLETE SET OF 8 HANDBOOKS 

Save $50.00 off the retail price of $175.00 

AUTOMOTIVE HANDBOOK 

(Not included in handbook Set) 

COMPONENTS QUALITY/RELIABILITY HANDBOOK 

(Available in July) 

EMBEDDED CONTROLLER HANDBOOK 

(2 Volume Set) 

MEMORY COMPONENTS HANDBOOK 
MICROCOMMUNICATIONS HANDBOOK 

MICROPROCESSOR AND PERIPHERAL HANDBOOK 

(2 Volume Set) 

MILITARY HANDBOOK 

(Not included in handbook Set) 

OEM BOARDS AND SYSTEMS HANDBOOK 
PROGRAMMABLE LOGIC HANDBOOK 
SYSTEMS QUALITY/RELIABILITY HANDBOOK 

PRODUCT GUIDE 

Overview of Intel's complete product lines 

DEVELOPMENT TOOLS CATALOG 

INTEL PACKAGING OUTLINES AND DIMENSIONS 

Packaging types, number of leads, etc. 

LITERATURE PRICE LIST 

List of Intel Literature 

*Good in the U.S. and Canada 

**These prices are for the U.S. and Canada only. In Europe and other international locations, please contact 
your local Intel Sales Office or Distributor for literature prices. 



280407 


$18.00 


296083 


$18.00 


231762 


$20.00 


210846 


N/C 


280199 


N/C 


231369 


N/C 



N/C 



InteT 



NAMF- 


LITERATURE SALES ORDER FORM 


nOMPANY- 


AnnRFCi<=5- 


niTY- 


STATF- 7IP- 


nniJNTRY- 


PHONE NO ■ ( 


) 





ORDER NO. 







































































































TITLE 



QTY. PRICE 

X 

X 



TOTAL 



.x_ 



.X. 
.X. 



Subtotal . 



Must add appropriate postage to subtotal 
(10% U.S. and Canada, 20% all other) 



Must Add Your 
Local Sales Tax . 



Total . 



Pay by Visa, MasterCard, American Express, Check, Money Order, or company purchase order payable 
to Intel Literature Sales. Allow 2-4 weeks for delivery. 

D Visa n MasterCard D American Express Expiration Date 

Account No. 

Signature: . _ 



Mail To: 



Intel Literature Sales 
P.O. Box 58130 
Santa Clara, CA 
95052-8130 



International Customers outside the U.S. and Canada 
should contact their local Intel Sales Office or Distributor 
listed In the back of most Intel literature. 
European Literature Order Form in back of book. 



Call Toll Free: (800) 548-4725 for phone orders 

Prices good until 12/31/88. 
Source HB 



Mail To: Intel Literature Sales 
RO. Box 58130 
Santa Clara, CA 95052-8130 



iny 



80960KB 

HARDWARE DESIGNER'S 

REFERENCE MANUAL 



1988 



iny- 



Intel Corporation makes no warranty for the use of its products and assumes no responsibility for any errors 
which may appear in this document nor does it make a commitment to update the information contained 
herein. 

Intel retains the right to make changes to these specifications at any time, without notice. 

Contact your local sales office to obtain the latest specifications before placing your order. 

The following are trademarks of Intel Corporation and may only be used to identify Intel Products: 

Above. BITBUS, COMMputer, CREDIT, Data Pipeline, FASTPATH. 
GENIUS, i, t, ICE, iCEL, iCS, iDBP, IDIS. |2|CE. iLBX, im, IMDDX, iMMX, 
Inboard, Insite, Intel, Intel, intelBOS, Intel Certified, Intelevision, 
inteligent Identifier, inteligent Programming, Intellec, Intellink, iOSP, 
iPDS, iPSC, IRMK, IRMX, iSBC, iSBX, iSDM, iSXM, KEPROM, Library 
Manager, MAP-NET, MCS, Megachassis, MICROMAINFRAME, 
MULTIBUS, MULTICHANNEL, MULTIMODULE, MultiSERVER, ONCE, 
OpenNET. OTP, PC-BUBBLE, Plug-A-Bubble, PROMPT, Promware, 
QUEST, QueX, Quick-Pulse Programming, Ripplemode, RMX/80, RUPI, 
Seamless, SLD, SugarCube, SupportNET, UPI, and VLSICEL, and the 
combination of ICE, iCS, IRMX, iSBC, iSBX, iSXM, MCS, or UPI and a 
numerical suffix, 4-SITE. 

MDS is an ordering code only and is not used as a product name or trademark. MDS® is a registered 
trademark of Mohawk Data Sciences Corporation. 

♦MULTIBUS is a patented Intel bus. 

Additional copies of this manual or other Intel literature may be obtained from: 

Intel Corporation 
Literature Distribution 
Mail Stop SC6-59 
3065 Bowers Avenue 
Santa Clara, CA 95061 



TABLE OF CONTENTS 



CHAPTER 1 
INTRODUCTION TO THE 80960KB MICROPROCESSOR 

Architectural Attributes for Embedded Computing 1-1 

Load/Store Design 1-1 

Large General-Purpose Register Sets 1-2 

Small Number of Addressing Modes 1-3 

Simplified Instruction Format 1-3 

Overlapped Execution 1-4 

Minimum cycle operation 1-4 

Additional 80960KB Architectural Enhancements 1-4 

Floating-point Operation 1-4 

Debug Capabilities 1-5 

Standard Bus Interface 1-5 

Inter-Agent Communication/Coprocessor Capabilities 1-5 

Summary . 1-5 

CHAPTER 2 
80960KB SYSTEM ARCHITECTURE 

Overview of a Single Processor System Architecture 2-1 

80960KB Processor and the L-Bus 2-1 

Memory Module 2-2 

I/O Module , 2-3 

Summary .2-3 

CHAPTER 3 
THE 80960KB PROCESSOR AND THE LOCAL BUS 

Overview of the 80960KB L-Bus . 3-1 

Basic L-Bus States 3-1 

L-Bus Signal Groups 3-3 

Address/Data 3-3 

Control 3-4 

L-Bus Transactions . 3-8 

Clock Signal 3-8 

Basic Read 3-8 

Basic Write 3-1 1 

Burst 3-12 

Timing Generation 3-14 

80960KB Processor Clock Requirements 3-14 

Clock Generation 3-15 

Arbitration 3-17 

Single 80960KB Processor on the L-Bus 3-18 



iny 



TABLE OF CONTENTS 



State Diagram 3-18 

Arbitration Timing 3-19 

Two 80960KB Processors on the L-Bus 3-20 

Bus states for Two 80960KB Processors 3-21 

Arbitration Timing for Two 80960KB Processors on the L-Bus 3-23 

Bus Exchange Example Between Two 80960KB Processors 3-23 

A Peripheral Device As the Default Bus Master 3-24 

Inter-Agent Communication (lAC) 3-25 

Overview of lAC Operations 3-26 

lAC Messages 3-26 

Hardware Requirements for External lAC Messages 3-27 

Message Buffers 3-27 

lAC Pin Logic . . 3-27 

External Priority Register 3-28 

Hardware Requirements 3-28 

External Priority and lAC Messages 3-28 

Interrupts 3-29 

Interrupt Signals 3-29 

Interrupt Control Register 3-30 

Using the Four Direct Interrupt Pins . 3-31 

Using an External Interrupt Controller 3-31 

Using lAC Requests for Interrupts 3-32 

Synchronization 3-32 

RESET and Initialization 3-33 

RESET Timing Requirements 3-33 

RESET Timing Generation .3-33 

Initialization 3-34 

Error Signals 3-37 

Summary . 3-37 

CHAPTER 4 
MEMORY INTERFACE 

Basic Memory Interface 4-1 

Data Transceivers 4-1 

Address Latch/Demultiplexer .4-2 

Address Decoder 4-2 

Burst Logic 4-3 

Timing Control Logic 4-5 

Byte Enable Latch 4-6 

SRAM Interface .4-6 

SRAM Interface Logic 4-6 

SRAM Timing Considerations 4-7 

DRAM Controller 4-11 

Address Multiplexer . . . . 4-13 



Intel 



TABLE OF CONTENTS 



Refresh Interval Timer 4-13 

Arbiter 4-13 

DRAM Timing and Control 4-13 

Timing Considerations for the DRAM Controller 4-17 

DRAM Interleaving 4-19 

Summary 4-20 

CHAPTER 5 
I/O INTERFACE 

Interfacing to 8-bit and 16-bit Peripherals 5-1 

General System Interface 5-1 

Data Transceivers 5-3 

Address Latch/Demultiplexer 5-3 

Address Decoder 5-3 

Timing Control Logic 5-4 

I/O Interface Design Examples 5-5 

8259A Programmable Interrupt Controller 5-5 

Interface 5-5 

Operation .5-7 

82530 Serial Communication Controller Example 5-7 

82586 Local Area Network Coprocessor Example 5-9 

Interface 5-10 

Operation 5-12 

82786 Graphics Coprocessor Example 5-14 

Interface 5-1 5 

Operation 5-1 7 

Summary 5-19 



Figures 

1-1. Local Register Set 1-2 

1-2. Global Register Set 1-3 

2-1 . Basic 80960KB System Configuration 2-2 

3-1. Basic L-Bus States . 3-2 

3-2. L-Bus Signal Groups 3-3 

3-3. Byte Enable Timing Diagram 3-5 

3-4. Clock Relationships 3-8 

3-5. 80960KB Processor Read Transaction 3-10 

3-6. 80960KB Processor Write Transaction . .3-12 

3-7. 80960KB Processor Burst Read Transaction 3-13 

3-8. 80960KB Processor Burst Write Transaction .3-14 



intel 



TABLE OF CONTENTS 



3-9. System Clock Pulse 3-15 

3-10. Clock Generation Circuit 3-16 

3-1 1 . Clock Timing Waveforms 3-17 

3-12. L-Bus States with Arbitration ...... 3-19 

3-13. Arbitration Timing Diagram for a Bus Master 3-20 

3-14. Arbitration Connection Between Two 80960KB Processors 3-21 

3-15. L-Bus States for Secondary Bus Master 3-22 

3-16. Arbitration Timing Diagram for an SBM 3-23 

3-17. Example of a Bus Exchange Transaction 3-24 

3-18. Forced Relinquishment Timing Diagram for an SBM 3-25 

3-1 9. Example Flow Chart for an lAC Operation 3-26 

3-20. Data Settings ... . 3-27 

3-21 . Physical Address Interpretation for lAC Messages 3-28 

3-22. Interrupt Control Register . 3-30 

3-23. Timing Diagram for Interrupt Acknowledge Transaction 3-31 

3-24. RESET Timing Diagram 3-33 

3-25. Asynchronous RESET Circuit 3-33 

3-26. Diagram for RESET Timing Generation 3-34 

3-27. Synchronous RESET Circuit 3-34 

3-28. Initialization Flow Chart 3-35 

3-29. RESET Signal Timing Relationship 3-36 

4-1. Simplified Block Diagram for Memory Interface Logic 4-2 

4-2. Burst Logic Flow Chart . 4-4 

4-3. Timing Control Logic Block Diagram 4-5 

4-4. Block Diagram for SRAM Interface 4-7 

4-5. Critical Timing Path for SRAM Read Operation . . 4-8 

4-6. Critical Timing Path for SRAM Write Transaction 4-10 

4-7. DRAM Controller Block Diagram ... 4-12 

4-8. Flow Chart for DRAM Timing and Control Logic 4-15 

4-9. Timing Diagram for Two-word DRAM Read Transaction 4-18 

4-10. Timing Diagram for Two-word DRAM Write Transaction 4-19 

5-1 . Simplified General System Interface 5-2 

5-2. Timing Control Block Diagram 5-4 

5-3. Block Diagram for 8259A Interface 5-6 

5-4. Block Diagram for 82530 Interface 5-8 

5-5. LAN Station ....... 5-10 

5-6. Block Diagram for LAN Controller Interface 5-11 

5-7. Byte Enable Generation Circuit 5-12 

5-8. Operational Flow Diagram for 82586 Interface 5-14 

5-9. Block Diagram for 82786 Interface . 5-16 

5-10. Operational Flow Diagram for 82786 Interface Circuit 5-18 



intel^ 



TABLE OF CONTENTS 



Tables 

3-1 . SIZE Signal Decoding 3-4 

3-2. Byte Enable Signal Decoding 3-5 

3-3. Summary of L-Bus Signals 3-7 

3-4. Combination of Bus Masters 3-18 

4-1 . Byte Enable Signal Decoding 4-6 



Preface 



PREFACE 



This manual serves as the definitive hardware reference guide for system designs using the 
80960KB and 80960KA processors (for clarity, references to the 80960KB apply to both the 
80960KA and 80960KB processors). Hardware designers can use this manual as a guideline 
for developing microprocessor systems. Readers of this manual should be familiar with the 
operating principles of microprocessors and with the 80960KB data sheet. 

This manual presents the 80960KB system design from a hardware perspective. Other infor- 
mation on the software architecture, instruction set, and programming of the 80960KB proces- 
sor can be found in the 80960KB CPU Programmer' s Reference Manual. 

Together with the 80960KB Hardware Designer s Reference Manual, these publications 
provide a complete description of the 80960KB system for hardware and software designers. 

MANUAL ORGANIZATION 

This manual, divided into five chapters, describes how to build a hardware system using the 
80960KB processor. The list below shows a synopsis of each chapter. 

• Chapter 1 briefly introduces the 80960KB component architecture. 

• Chapter 2 presents an overview of the 80960KB hardware system design, which includes a 
system configuration illustrating the various components that constitute a 80960KB sys- 
tem. 

• Chapter 3 describes the local bus and the interface to the 80960KB processor. This 
chapter includes detailed signal descriptions and discusses timing generation, arbitration, 
interrupt handling, and initialization. 

• Chapter 4 discusses techniques for designing memory subsystems. 

• Chapter 5 presents guidelines on how to interface I/O devices to the local bus. 

Wherever appropriate, design examples are included in the chapters. These designs are based 
upon functional 80960KB boards and systems, and are simplified for ease of understanding. 
The simplified versions of these designs have not been tested except for the figures that show 
the part numbers. 

NOTATION CONVENTIONS 

This manual uses the following style conventions. 

• Integer numbers are presented in decimal notation unless otherwise indicated by the sub- 
script "H" for hexidecimal or "B" for binary. 



An active low signal is represented by a line over the signal name. For example, READY 
is an active low signal. 
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CHAPTER 1 
INTRODUCTION TO THE 80960KB MICROPROCESSOR 



The 80960KB is the first 32-bit microprocessor designed especially for embedded applications. 
At an operating frequency of 20 MHz, this high performance processor can sustain an instruc- 
tion execution rate of seven and one-half million instructions per second (MIPS), and burst 
rates of 20 MIPS^ The 80960KB processor enhances embedded system performance by 
integrating special features to eliminate the need for additional peripheral devices and the 
associated software overhead. For instance, the 80960KB processor offers an on-chip floating- 
point processing unit, an improved interrupt handling capability, and support for debugging 
and tracing. 

This chapter describes the architectural attributes and enhancements of the 80960KB processor 
for embedded computing. 

ARCHITECTURAL ATTRIBUTES FOR EMBEDDED COMPUTING 

For over a decade, Intel has designed a large variety of 8- and 16-bit microcontrollers to fit the 
needs of embedded applications. Based on this experience, several architectural attributes 
shared by both microcontrollers and microprocessors can be implemented that benefit em- 
bedded applications and enhance microprocessor performance. Because the 80960KB proces- 
sor incorporates these attributes (listed below) in its architecture, embedded applications are 
easy to design, perform well, and get to market fast. 

Simple load/store design 

Large general-purpose register sets 

Boolean and bit-field instructions 

Small number of operations and addressing modes 

Simplified instruction format 

Minimum cycle operation 

Load/Store Design 

In the 80960 family architecture, operations are register-to-register, with only LOAD and 
STORE instructions accessing memory. This attribute simplifies the instruction set and 
shortens cycle time. 

The 80960KB processor uses LOAD and STORE instructions to access memory. It further 
minimizes accesses to memory by providing a 512-byte, direct-mapped instruction cache. 
When a memory access is required, the processor can perform a burst transaction that accesses 
up to four data words with one word transferred every clock cycle. 



^DEC VAX 1 1/780 equals 1 MIPS 
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Large General-Purpose Register Sets 

Because the instructions operate on operands within registers, the 80960 family uses many 
registers. The 80960KB processor features large, versatile register sets. For maximum 
flexibility, each processor provides 32 32-bit registers and foui* 80-bit floating-point registers. 

There are two types of general-purpose registers: local and global. The processor automati- 
cally accesses the 16 local registers wlifen a procedure call is performed. Multiple sets of local 
registers are stored on-chip to further increase the efficiency of this register set, as shown in 
Figure 1-1. The register cache holds up to four local register frames, which means that up to 
three procedure calls can be made without having to access the procedure stack resident in 
memory. 



MULTIPLE 

REGISTER 

SETS 



ONE OF 

FOUR LOCAL 

REGISTER 

SETS 



LOCAL REGISTER SET 




Figure 1-1: Local Register Set 

The 20 global registers retain their contents across procedure boundaries. The global registers 
consist of sixteen 32-bit registers (Gj^ through Gq) and four 80-bit registers (FP3 through FPq), 
as shown in Figure 1-2. While all registers can be used for floating-point operations, the 80-bit 
registers are used for accumulation of extended precision results. 
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GLOBAL REGISTERS 



M5 



Go 



31 



FLOATING POINT REGISTERS* 



FP3* 
FPo* 



79 



Note: 



Any register can be used for floating-point operations. The 80-bit 
registers are provided for extended precision accumulation. 



Figure 1-2: Global Register Set 



Small Number of Addressing Modes 

The 80960 family uses relatively few addressing modes to facilitate a fast, simple inter- 
pretation by the control engine. The 80960KB processor provides simple, fast addressing 
modes, as well as a few complex addressing modes to allow optimizations for code density. 

Simplified Instruction Format 

A simplified instruction format eases the hardwired decoding of instructions, which again 
speeds control paths. The 80960KB processor's instruction formats are simple and word 
aligned; all instructions are one word long except for one class that uses the subsequent word 
as a 32-bit displacement. To further enhance performance, the instructions do not cross word 
boundaries. This feature eliminates a pipeline stage (that would have to align instructions) and 
decreases instruction execution time. 
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Overlapped Execution 

To optimize performance, the 80960KB processor overlaps instruction execution by means of 
write buffering and register scoreboarding. Write buffering allows a write instruction to 
proceed as soon as it is placed in the buffer. It does not have to wait for the actual write 
operation to occur on the L-bus. 

Similarly, register scoreboarding is a design technique that allows the 80960KB to continue 
execution of instructions when it encounters a LOAD instruction. When the LOAD instruction 
begins, the 80960KB sets a scoreboard bit on the target register. After the target register is 
loaded with data, the processor resets the bit. While the data is being retrieved, additional 
instructions that do not reference the target register can be executed. The 80960KB ensures 
that these additional instructions do not reference the target register by checking the scoreboard 
bit transparently (no software required). Thus, the scoreboard feature reduces the effect of 
slow memory speed and provides a useful tool for optimizing procedures. 

Minimum cycle operation 

The 80960KB processor executes most of the core instructions in a single clock cycle. For 
these instructions, the 80960KB processor uses hardwired logic rather than microcode to ex- 
ecute the instruction. 

The 80960KB also supports a number of important multicycle instructions, such as 32-bit 
multiply and divide instructions. These auxiliary functions require more than one clock cycle 
because it is more efficient to use microcode than hardwired logic. On the other hand, the 
integration of these functions on-chip eliminates much software overhead and the negative 
effects on code density that would be otherwise required. Thus, the additional functionality of 
the 80960KB enhances overall system performance while keeping code size small. 

ADDITIONAL 80960KB ARCHITECTURAL ENHANCEMENTS 

The 80960KB incorporates two useful features: an on-chip floating-point processing and 
debugging functions. The floating-point unit can be used for applications that require preci- 
sion, such as machine-control operations. The debugging function significantly decreases 
development time. 

Floating-point Operation 

The on-chip floating-point unit of each processor improves the performance of floating-point 
calculations by eliminating bus overhead used to transfer operands to a coprocessor. The 
processor provides hardware support for both mandatory and recommended portions of IEEE 
standard 754 for floating-point arithmetic, exponential, logarithmic, and other transcendental 
functions. By integrating the floating-point unit on-chip, the 80960KB processor reduces the 
overall chip count for a system, decreases power consumption, and increases overall perfor- 
mance and reliability. 



1-4 



intel 



INTRODUCTION TO THE 80960KB MICROPROCESSOR 



Debug Capabilities 

The processor provides extensive system debug capabilities, an important feature for em- 
bedded computing where the abihty to instrument an application may be limited. The 
80960KB processor allows breakpoint instructions that stop program execution on various 
events, such as procedure calls, or certain instructions. Another debug facility traces the 
activity of the processor while it is executing a program. Tracing is done by recording the 
addresses of instructions that cause trace events to occur. For example, a trace event can occur 
on the execution of a specific instruction, branch, or procedure call. To ensure that the 
80960KB is operating properly, the processor performs a self-test when it is reset. If the 
self-test is successful, the 80960KB begins operation, otherwise it enters the stopped state. 

STANDARD BUS INTERFACE 

The advanced features of the 80960KB processor are implemented using a performance- 
optimized bus interface. The processor uses a high bandwidth local bus (L-bus) that consists 
of standard signal groups: a 32-bit multiplexed address/data path and control signals for data 
transactions. Because of the large amount of caching, the L-bus supports burst transactions 
that transfer up to four successive data words. Transactions on the L-bus can use 8-, 16-, and 
32-bit data types and address up to 4G bytes of physical memory. Bus arbitration can be 
accomplished by simply using the hold request/hold acknowledge protocol. 

INTER-AGENT COMMUNICATION/COPROCESSOR CAPABILITIES 

The 80960KB processor offers a flexible way to manage interrupts. It accepts interrupts in one 
of three ways: by communicating with an external interrupt controller using the standard 
Interrupt/Interrupt Acknowledge signals, by activating the on-chip interrupt controller, or by 
accepting an inter-agent communication (I AC) message. This allows the 80960KB to act as a 
coprocessor on a shared bus with another CPU. 

SUMMARY 

The 80960KB processor optimizes embedded system performance by using a new 32-bit 
architecture. The 80960 family architecture includes a load/store design, large general purpose 
register sets, fast addressing modes, a simplified instruction format, and minimized instruction 
execution cycles. 

To further enhance system performance, the 80960KB processor provides floating-point opera- 
tion, interrupt controller capabilities, and debug functions. By intergrating these functions 
on-chip, the 80960KB reduces the power requirements and overall chip count for a system. 

As a result of the 80960 architecture, the 80960KB processor provides unprecedented perfor- 
mance. For a speed selection of 20 MHz, it can sustain an instruction execution rate of over 
seven and one-half million MIPS and burst rates of 20 MIPS, speeds comparable to that of 
super minicomputers. The high instruction execution rates are made possible through a in- 
novative design that incorporates an on-chip instruction cache with burst-transfer capability. 
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CHAPTER 2 
80960KB SYSTEM ARCHITECTURE 



This chapter illustrates the flexibility and power of the 80960KB system architecture using the 
advanced 32-bit 80960KB processor. This chapter examines system configurations from a 
general perspective to explain the design concepts. Subsequent chapters describe the details of 
the system design. 

OVERVIEW OF A SINGLE PROCESSOR SYSTEM ARCHITECTURE 

The central processing module, memory module, and I/O module form the natural boundaries 
for the hardware system architecture. The modules are connected together by the high 
bandwidth 32-bit multiplexed L-bus, which can transfer data at a maximum sustained rate of 
53M bytes per second for an 80960KB processor operating at 20 MHz. 

Figure 2-1 shows a simplified block diagram of a possible system configuration. The heart of 
this system is the 80960KB processor, which fetches program instructions, executes code, 
manipulates stored information, and interacts with I/O devices. The high bandwidth L-bus 
connects the 80960KB processor to memory and I/O modules. The 80960KB processor stores 
system data and instructions and programs in the memory module. By accessing various 
peripheral devices in the I/O module, the 80960KB processor supports communication to 
terminals, modems, printers, disks, and other I/O devices. 

80960KB Processor and the L-Bus 

The 80960KB processor performs bus operations using multiplexed address and data signals 
and provides all the ne cessar y control signals. For example, standard control signals,_such as 
Address Latch Enable (ALE), Address/Data Status (ADS), Write/Read command (W^), Data 
Transmit/Receive (DT/R), and Data Enable (DEN) are provided by the 80960KB processor. 
The 80960KB processor also generates byte enable signals that specify which bytes on the 
32-bit data lines are valid for the transfer. 

The L-bus supports burst transactions, which access up to four data words at a maximum rate 
of one word per clock cycle. The 80960KB processor uses the two low-order address lines to 
indicate how many words are to be transferred. The 80960KB processor performs burst 
transactions to load the on-chip 512-byte instruction cache to minimize memory accesses for 
instruction fetches. Burst transactions can also be used for data accesses. 

To transfer control of the bus to an external bus master, the 80960KB processor provides two 
arbitration signals: hold request (HOLD) and hold acknowledge (HLDA). After receiving 
HOLD, the processor grants control of the bus to an external bus master by asserting HLDA. 
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PROCESSOR MODULE 



MEMORY MODULE 
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Figure 2-t : Basic 80960KB System Configuration 

The 80960KB processor provides a flexible interrupt structure by using an on-chip interrupt 
controller, an external interrupt controller, or both. The type of interrupt structure is specified 
by an internal interrupt vector register. For a system with multiple processors, another method 
is available, called inter-agent communication (lAC) where a processor can interrupt another 
processor by sending an lAC message. 

Complete details of the L-bus and bus operations are discussed in Chapter 3. 

Memory Module 

A memory module can consist of the memory controller. Erasable Programmable Read Only 
Memory (EPROM), and static or dynamic Random Access Memory (RAM). The memory 
controller first conditions the L-bus signals for memory operation. It demultiplexes the address 
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and data lines, generates the chip select signals from the address, detects the start of the cycle 
for burst mode operation, and latches the byte enable signals. 

The memory controller generates the control signals for EPROM, SRAM, and DRAM. In 
particular, it provides the control signals, multiplexed row/column address, and refresh control 
for dynamic RAMs. The controller can be designed to accommodate the burst transaction of 
the 80960KB processor by using the static column mode or nibble mode features of the 
dynamic RAM. In addition to supplying the operation signals, the controller generates the 
READY signal to indicate that data can be transferred to or from the 80960KB processor. 

The 80960KB processor directly addresses up to 4G bytes of physical memory. The processor 
does not allow burst accesses to cross a 16-byte boundary to ease the design of the controller. 
Each address specifies a four-byte data word within the block. Individual data bytes can be 
accessed by using the four byte enable signals from the 80960KB processor. 

Chapter 4 provides design guidelines for the memory controller. 

I/O Module 

The I/O module consists of the I/O components and the interface circuit. I/O components can 
be used to allow the 80960KB processor to use most of its clock cycles for computational and 
system management activities. Time consuming tasks can be off-loaded to specialized slave- 
type components, such as the 8259A Programmable Interrupt Controller, or the 82530 Serial 
Communication Controller. Some tasks may require a master- type component, such as the 
82586 Local Area Network Control. 

The interface circuit performs several functions. It demultiplexes the address and data lines, 
generates the chip select signals from the address, produces the I/O read or I/O write x^ ommand 
from the processor's W/R signal, latches the byte enable signals, and generates the READY 
signal. Because these functions are the same as some of the functions of the memory con- 
troller, the same logic can be used for both interfaces. For master-type peripherals that operate 
on a 16-bit data bus, the interface circuit translates the 32-bit data bus to a 16-bit data bus. 

The 80960KB processor uses memory-mapped addresses to access I/O devices. This allows 
the CPU to use many of the same instructions to exchange information for both memory and 
peripheral devices. Thus, the powerful memory-type instructions can be used to perform 8-, 
16-, and 32-bit data transfers. 

Chapter 5 describes design guidelines for the I/O interface by examining representative design 
examples. 

SUMMARY 

The basic hardware system configuration is modular and flexible. The processor, memory, and 
I/O modules form the natural boundaries in the basic hardware system architecture. The 
high-bandwidth L-bus that supports burst transfers is used for the data path between the 
80960KB processor and other modules. 



2-3 



iny 



80960KB SYSTEM ARCHITECTURE 



This chapter presents an overview for basic hardware system design. The next three chapters 
discuss the details of the L-bus, memory modules, and I/O modules. 
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CHAPTERS 
THE 80960K6 PROCESSOR AND THE LOCAL BUS 



The 32-bit multiplexed local bus (L-bus) connects the 80960KB processor to memory and I/O 
and forms the backbone of any 80960KB processor based system. This high bandwidth bus 
provides burst-transfer capability allowing up to four successive 32-bit data word transfers at a 
maximum rate of one word every clock cycle. In addition to the L-bus signals, the 80960KB 
processor uses other signals to communicate to other bus masters. This chapter, which 
describes these signals and the associated operations, follows the outline shown below: 

L-bus states and their relationship to each other 

L-bus signal groups, which consist of address/data and control 

L-bus read, write, and burst transactions 

L-bus timing analyses and timing circuit generation 

Related L-bus operations such as arbitration, interrupt, and reset operations 

OVERVIEW OF THE 80960KB L-BUS 

The L-bus forms the data communication path between the various components in a basic 
80960KB hardware system. The 80960KB processor utilizes the L-bus to fetch instructions, to 
manipulate information from both memory and I/O devices, and to respond to interrupts. To 
perform these functions at a high data rate, the 80960KB processor provides a burst mode, 
which transfers up to four data words at a maximum rate of one 32-bit word per clock cycle. 
The 80960KB L-bus has the following features: 

• 32-bit multiplexed address/data path 

• High data bandwidth relative to the speed selection of the 80960KB processor 

• Four byte enables and a four- word burst capability that allow transfers from 1 to 16 bytes 
in length 

• Support for TTL latches and buffers. 

BASIC L-BUS STATES 

The L-bus has five basic bus states: idle (T-), address (T^), data (T^), recovery (T^.), and wait 
(T^). During system operation, the 80960KB processor continuously enters and exits different 
bus states as shown in Figure 3-1. This state diagram assumes that only one bus master resides 
on the L-bus. 

The processor occupies the T- state when no address/data transfers are in progress. When a 
new request is received, the 80960KB processor enters the T^ state to transmit the address. 
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NOT READY - 
BURST 
NO BURST - 



READY asserted 



READY not asserted 
Multiple word access in progress 
Multiple word access done . or a 
one-word access 



Figure 3-1: Basic L-Bus States 

Following a T^ state, the 80960KB processor enters a T^ state to transmit or rec eive data on the 
address/data lines provided that the data is ready (indicated by the assertion of READY at the 
input of the processor). If the data is not ready, the processor enters a T^ state and remains in 
this state until data is ready. T^ states may be repeated as many times as necessary to allow 
sufficient time for the memory or I/O device to respond. 

After a data word is transferred, the 80960KB processor exits the T^ or T^ state for a single 
word transfer or enters the T^ state again to transfer another data word for a burst transaction. 
If the next data word is not ready during the next clock cycle for a burst transaction, the 
processor enters the T state again. 
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When the 80960KB processor completes the data transfer of all the data words (one or up to 
four), it enters the recovery (Tj.) state to allow sufficient time for devices (such as memories) 
on the bus to recover. The processor returns to the T- state if no new request is pending, or 
enters the T^ state if a new request is pending. 

L-BUS SIGNAL GROUPS 

The L-bus states are used to define some of the L-bus signals. As shown in Figure 3-2, the 
signals on the L-bus consist of two basic groups: address/data, and control. 



LOCAL BUS 



\ 

X 




CONTROL (12 LINES) 



Figure 3-2: L-Bus Signal Groups 



Address/Data 



The address/data signal group consists of 32 bidirectional Hnes. These signals are multiplexed 
and serve a dual purpose depending upon the bus state. 



LAD3J-LAD2 



Local Ad(lress/Data3j through Local Address/Data2 represent the 
address signals on the L-bus during the T^ state. LAD2 is the least 
significant bit, and LAD3j is the most significant address bit. LAD3J 
through LAD2 contain a physical word address. LADj and LADq 
specify the number of data words to transfer for a burst transaction. The 
address/data signals float to a high impedance state when not activated. 

SIZE (LAD J -LADq) The SIZE signal indicates whether one, two, three, or four words are 
transferred during the current transaction. During a T^ state, LAD^ and 
LADq represent the word size signals. The encoding is shown in Table 
3-1. 
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Table 3-1 : SIZE Signal Decoding 



WORD SELECTION 


LADi 


LAbo 


1 WORD 


LOW 


LOW 


2 WORDS 


LOW 


HIGH 


3 WORDS 


HIGH 


LOW 


4 WORDS 


HIGH 


HIGH 



LAD3J-LADQ 



Local Address/I)ata3j through Local Address/DataQ represent the 
data signals on the L-bus during the T^ and T^ states. LADq is the least 
significant, and LAD3j is the most significant address bit. The 
address/data signals float to a high impedance state when not activated. 



Control 



The control signal group consists of 12 signals that permit the transfer of data. These signals 
can be used to control data buffers, address latches, and other standard interface logic. 



ALE 



ADS 



DT/R 



DEN 



W/R 



BE3-BEQ 



The Address Latch Enable is an active low si gnal th at can be used to 
latch the address from the 80960KB processor. ALE is asserted during 
the T^ state and deasserted before the beginning of the T^ state. ALE 
floats to a high impedance level when the processor is not operating on 
the bus (i.e., it is in the idle state), or is at the end of any bus access. 

Address/Data Status is an active low signal that is driven by the 
80960KB processor to indicate an address state. ADS is asserted during 
every T^ state and deasse rted during the following T^ and T^ states. For 
a burst tr ansaction, ADS is asserted again every T^ (an d T^) state where 
READY was asserted in the prior cycle. The ADS signal is an open 
drain output. 

Data Transmit/Receive indicates the direction of data flow to or from 
the L-bus. For a read operation or an interrupt acknowledgement, DT/R 
is low during the T^, T^, and T^ states to indicate_that data flows into the 
80960KB processdr. For a write operation, DT/R is high during the T^, 
T^, and T^ states to indicate that data flows from the 80960KB proces- 
sor. DT/R never changes states when DEN is asserted. The DT/R line 
is an open drain output of the 80960KB processor. 

Data Enable is an active-low signal that can be used to enable data 
transceivers. DEN is asserted during all T^ and T^ states. The DEN 
line is an open drain output of the 80960KB processor. 

The Write/Read signal instructs a memory or I/O device t^ write or 
read data on the L-bus. The 80960KB processor asserts W/R during a 
T^ state. The signal remains valid during subsequent T^ and T^ states. 
W/R is an open drain output of the 80960KB processor. 

The Byte Enable output signals of the 80960KB processor specify 
which bytes (up to four) on the 32-bit data bus are transferred during the 
transaction. Table 3-2 shows the decoding scheme. 
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Table 3-2: Byte Enable Signal Decoding 



BYTE ENABLE SIGNAL 


ADDRESS LINE SELECTION 


BEo 


LAD7.LAD0 


Bl, 


LAD,5.LAD8 


BI2 


LAD23.LAD,6 


BE3 


LAD3,.LAD24 



The byte enable signals are valid from the 80960KB processor before 
data is transferred, as shown in Figure 3-3 (assumes no wait states). The 
byte enable signals that are valid for the first data word are specified 
during the T^ state. For a four-word burst transaction, the byte enable 
signals that are valid for the second word are asserted during the first 
data state (T^q^' ^^^ ^^^ ^^^^^ ^^^^ during the second data state (T^i), 
and for the fourth word during the third data state (T^2)- The byte 
enable signals are undefined during the last data state (T^3) of the last 
word transferred. 



TdO 



Tdl 



Td2 



Td3 



Tr 



LAD31- 
LADo 



BE3-BE0 



ersQ 



DATAq 



XZIEi 



iiX DATAi 




Br$2 



DATA2 



^CZEE 



"Xi^ 



OATA3 



" X UMDEFtNED 



XHZZII 



x: 



READY 



Figure 3-3: Byte Enable Timing Diagram 

Although not shown in the diagram, the byte enable signals of each word 
are latched internally by the 8Q9 60KB pr ocessor and remain valid duri ng 
every data or wait state until READY is applied. After READY is 
applied the byte enable signals change during the next T^ state or be- 
come undefined for the last data transfer. 

The 80960KB processor asserts only adjacent byte enables. For ex- 
ample, the 80960KB processor does not perform a bus operation with 
only BEq and BE2 active. 

The Byte Enable lines are open drain outputs. 



READY signal indicates that the data on the L-bus c an be sa mpled 
(read) or removed (write) by the 80960KB processor. If READY is not 
asserted followi ng T^ st ate or in between T^ states, a T^ state is 
generated. The READY is an active-low input signal to the 80960KB 
processor. 
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LOCK 



CACHE 



Bus Lock prevents other bus masters from gaining control of the L-bus 
during a bus operation. It is activated by certain 80960KB processor 
operations and instructions. 



The 80960KB processor uses the bus LOCK signal when it performs a 
RMW memory operation . When the processor performs a RMW-Read 
operation, it asserts t he LOC K signal during the T^ state and holds 
LOCK asserted. If the LOCK signal was already asserted, the processor 
waits until this signal is deasserted be fore pe rforming the RMW-Read 
operation. The processor deasserts the LOCK signal during the T^ state 
when it performs a RMW- Write operation. 



The 80960KB processo r assert s the LOCK signal during the interrupt 
acknowledge sequence. LOCK is an input and an open drain output. 

The Cacheable signal specifies whether the data is cacheable. If the 
80960KB processor asserts CACHE during the T^ state, then the data is 
cacheable. The CACHE signal is undefined during the T^ and T^ states. 
The CACHE signal floats to a high impedance state when the L-bus is 
not acquired. 



Table 3-3 summarizes the L-bus signals. 
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Table 3-3: Summary of L-Bus Signals 



SIGNAL 
GROUP 


SIGNAL 
SYMBOL 


SIGNAL FUNCTION 


ACTIVE 
STATE 


DIRECTION 


TYPE OF 
OUTPUT 


LOCAL 

ADDRESS/ 

DATA 


ADDRESS 
(LAD31-LAD2) 


32-BIT ADDRESS 


Ta 





3-STATE 


DATA 
(LAD31-LAD0) 


32-BIT DATA 


Td.Tw 


I/O 


3-STATE 


SIZE 
(LADi-LADq) 


SPECIFIES NUMBER OF 
WORDS TO TRANSFER 


Ta 





3-STATE 


CONTROL 


ALE 


ENABLES ADDRESS 
LATCH 


Ta 





3-STATE 


ADS 


IDENTIFIES AN ADDRESS 
STATE 


Ta,Td^T,l 





OPEN DRAIN 


DT/R 


CONTROLS DIRECTION OF 
DATA FLOW 


Ta.Td.T^ 





OPEN DRAIN 


DEN 


ENABLES DATA 
TRANSCEIVER/LATCH 


Td.Tw 





OPEN DRAIN 


W/R 


READ/WRITE COMMAND 


Ta.T^T, 





OPEN DRAIN 


"BE3-BE0 


SPECIFIES WHICH DATA 
BYTES TO TRANSFER 


Ta.Td2.T,2 





OPEN DRAIN 




INDICATES DATA IS 
READY TO TRANSFER 


Td.Tw 


1 




READY 






LOCKS BUS 


ANY 


I/O 


OPEN DRAIN 


LOCK 


CACHE 


INDICATES CACHEABLE 
TRANSACTION 


Ta 





3-STATE 



Additional pins are used by the 80960KB processor to control the execution of instructions and 
to interface to other bus masters. These pins include the arbitration, interrupt, error, and reset 
signals. Each of these signal groups are explained in separate sections. 
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L-BUS TRANSACTIONS 

The 80960KB processor uses the L-bus signals to perform transactions, which are simply 
L-bus operations where data is transferred to (or from) the CPU from (or to) another com- 
ponent. During a transaction, the 80960KB processor can transfer up to four words of data for 
a single address to enhance system throughput. This is especially useful when loading cache 
memory. 

Clock Signal 

The 80960KB hardware system typically uses two clock signals, CLK2 and CLK, to 
synchronize the transitions between L-bus states. CLK2 is the clock input to the 80960KB and 
is double the specified processor frequency. CLK is the clock input signal to the peripheral 
devices, and it is the operating frequency of the 80960KB processor. Figure 3-4 shows the 
relationship between the system CLK2 and CLK. 
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Figure 3-4: Clock Relationships 



Basic Read 



The basic transaction reads or writes one data word. Figure 3-5 shows a typical timing 
diagram for a basic read transaction (for exact timings, see the 80960KB processor data sheet). 
A read transaction may be preceded and succeeded by any type of bus transaction. The 
following sequence of events explains the flow of the timing diagram. For simplicity, no wait 
states are shown. 

1. The 80960KB processor generates several signals during the T^ state. 

• It transmits the address on the address/data lines. LAD^ and LADq specify a single 
word transaction. 
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It asserts ALE. An ALE signal can be used to latch the address. 



It asserts ADS. 

It asserts BE3-BEQ to specify which bytes are used when reading the data word. 

It brings W/R low to denote a read operation. 

It brings DT/R signal low. DT/R can be used for the direction input to data 
transceivers. 

2. During the T^ state, several actions occur. 

• The 80960KB processor reads the data on the address/data lines. 



• The 80960KB processor asserts DEN. DEN can be used to enable data transceivers. 
READY is asserted by external timing logic and data is transmitted from the storage 
devices. If READY is not ass erted, the data transfer is delayed generating a T^ state. 
The T^ state is repeated, until READY is asserted. 

3. The Tj. state follows the data state. This allows the system components adequate time (one 
processor clock cycle) to remove their outputs from the bus before the 80960KB processor 
generates the next address on the address/data lines. During the T^. state W/R, DT/R, and 
DEN become inactive. 
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Figure 3-5: 80960KB Processor Read Transaction 
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Basic Write 

Figure 3-6 shows a typical timing diagram for a basic write transaction with one wait state. 
Like the read transaction, a write operation may be preceded and succeeded by any type of bus 
transaction. The following sequence of events explains the flow of the timing diagram. 

1. Similar to the read transaction, the 80960KB processor generates several signals during 
the T^ state. 

It transmits the address on the address/data lines. LADj and LADq specify a single 
word transaction. 



It asserts ALE. An ALE signal can be used to latch the address. 



It asserts ADS. 

It asserts BE3-BEQ to specify which bytes are used when writing the data word. 

It brings W/R high to denote a write operation. 

It brings DT/R signal high. DT/R can be used for the direction input to data 
transceivers. 

2. During the T^ state, several actions occur. 

• The 80960KB places the data on the address/data lines. 



The 80960KB processor asserts DEN. DEN can be used to enable data transceivers. 



READY is not asserted by external timing logic. Consequently, data is held on the 
LAD lines. 



3. During the T^ state READY is asserted and the data is written to the s torage de vice. Note 
that the W/R, DT/R, and DEN remain constant until the bus state after READY is asserted. 

4. The Tj. state follows the wait state. During the T^. state W/R, DT/R, and DEN become 



inactive. 
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Figure 3-6: 80960KB Processor Write Transaction 



Burst 



The 80960KB processor supports burst transactions that read or write up to four words at a 
maximum rate of one word every processor clock cycle. Burst transactions are always con- 
tained within a 16-byte boundary. If a transaction crosses a 16-byte boundary, the 80960KB 
processor automatically splits the transaction into two accesses. 

The byte enable signals are valid for each word to allow partial- word write operations for a 
burst write transaction. The CACHE output signal during a T^ state applies to all words of a 
burst transaction. 
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A burst read or write transaction is similar to a basic read or write operation. It differs 
primarily in the number of data words transferred: the basic transaction always transfers one 
data word, the burst transaction transfers up to four data words. For a burst transaction, the 
byte enable signals are applied during the T^ state, and subsequently during every T^ or T^ 
state before the data word is transferred. Figure 3-7 shows the timing for a three-word burst 
read transaction without wait states. Figure 3-8 shows the timing for a two-word burst write 
transaction with a wait state occurring during the transfer of the first word. Note that the byte 
enable signals remain constant until the data state after READY is asserted. 




Figure 3-7: 80960KB Processor Burst Read Transaction 
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Figure 3-8: 80960KB Processor Burst Write Transaction 



TIMING GENERATION 

In an 80960KB processor-based system, timing signals must be generated for the clock and 
reset inputs. To generate these signals, discrete logic should be utilized to minimize skew and 
maintain the rise and fall times as short as possible. This section describes a typical circuit that 
synthesizes the clock signal. The RESET timing generation is discussed in the "RESET and 
Initialization" section on page 3-33. 

80960KB Processor Clock Requirements 

In order to design a clock generator, the clock input specifications to the 80960KB processor 
are examined first. The clock (CLK2) waveform is shown in Figure 3-9. The clock pulse is 
specified by five parameters listed below: 

• The clock fall time (tf) 

• The clock low time (tj) 
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• The clock rise time (tj.) 

• The clock high time (tj^) 

• The clock period (t^ ) 

The time required to go from 90% of the difference between the high and low voltage levels to 
10% of the difference (or from low to high) is defined as the clock fall (rise) time. The clock 
low time specifies the time required for the clock to remain within 10% of the low voltage 
level. Similarly, the clock high time specifies the required time for the clock pulse to remain 
within 10% of the high voltage level. The clock period is the sum of t^ + tj + t^. + tj^. 
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Figure 3-9: System Clock Pulse 

The clock generator must have fast enough rise and fall times to comply with the requirements 
for high and low time and the overall clock period. For example, consider a clock pulse with a 
50% duty cycle at 40 MHz. The clock period is specified at minimum of 25 ns, low time at 
minimum of 8 ns, and high time at minimum of 8 ns. This implies that the sum of the rise and 
fall time must not be greater than 9 ns. Thus, the clock generator should be designed to have 
rise and fall times not greater than 4.5 ns each. 

Besides specifying a maximum clock rate, the 80960KB processor requires a minimum CLK2 
rate of 8 MHz to maintain the state of the internal dynamic cells. Due to this minimum 
frequency requirement, the 80960KB processor cannot be single-stepped by disabling the 
clock. 



Clock Generation 

Figure 3-10 shows an example of a clock generator that produces two clock pulses, one double 
the frequency of the other with the skew between the pulses in the range of 1 to 3 ns. This 
particular circuit produces a 40-MHz clock at 50% duty cycle with rise and fall times of less 
than 4 ns. The circuit design consists of four devices: an oscillator, a pulse shaping network, a 
synchronous up/down counter, and a NAND gate driver. The output of the 80-MHz hybrid 
clock oscillator connects to the pulse shaping network (two NAND gates in series), which in 
turn feeds into the clock input of the up/down counter. This counter produces a 40-MHz CLK2 
output signal and a 20-MHz CLK output signal. Because the outputs of the counter are 
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synchronous, the skew between CLK2 and CLK is typically less than 2 ns. To provide 
adequate signal margin and maintain fast rise and fall times, the two clock signals are con- 
ditioned by the NAND gate driver. The timing waveforms of the clock circuit are shown in 
Figure 3-11. 

If the opposite phase CLK is preferred, U/D pin can be connected to V^.^. 
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Figure 3-1 0: Clock Generation Circuit 



3-16 



iny 



THE 80960KB PROCESSOR AND THE LOCAL BUS 




Figure 3-1 1 : Clock Timing Waveforms 

The hybrid clock oscillator typically requires 5 ms to stabilize after power is applied. The 
80960KB processor cannot begin to execute instructions until after the clock and V^.^. have 
reached their DC and AC specifications. The RESET signal can be used to control the start of 
the CPU execution when power is applied. This is discussed in the "RESET and Initialization" 
section on page 3-33. 

ARBITRATION 

When multiple bus masters exist, an arbitration protocol is used to exchange control of the bus. 
The protocol assumes that there are two bus masters: one that controls the bus by default, and 
the other that requests control of the bus when it performs an operation, such as a DMA 
controller. More than two bus masters may exist on the L-bus, but this requires external 
arbitration logic. There should be no more than two 80960KB processors, however, on an 
L-bus. 

Assuming that there are only two bus masters, this section examines the bus arbitration, bus 
states, and timing diagrams for different combinations of bus masters, as shown in Table 3-4. 
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Table 3-4: Combination of Bus Masters 





Bus Master Combination | 




Bus Master that Controls the Bus 
by Default 


Bus Master that Requests 
Contol of the Bus 


CASE1 


80960KB PROCESSOR 


I/O DEVICE 


CASE 2 


80960KB PROCESSOR 


80960KB PROCESSOR 


CASE 3 


I/O DEVICE 


80960KB PROCESSOR 



Single 80960KB Processor on the L-Bus 

For the first case, the 80960KB processor controls the L-bus, and a master I/O peripheral, such 
as a DMA controller, requests control of the bus for operations. The 80960KB processor and 
the I/O peripheral exchange control of the bus with two signals: the hold request (HOLD) and 
hold acknowledge (HLDA) signals. 

HOLD is an input signal of the 80960KB processor, which indicates that the master I/O 
peripheral is requesting control of the L-bus. When HOLD is asserted, the 80960KB processor 
surrenders control of the bus after it completes the current bus transaction. The 80960KB 
processor acknowledges transfer of control of the L-bus to the other bus master by asserting the 
HLDA. 



State Diagram 

Figure 3-12 shows the state diagram for a L-bus with an I/O peripheral bus master. This state 
diagram consists of the hold state (Tj^) in addition to the five basic states described in the 
"Basic L-Bus State" section on page 3-1. The 80960KB processor enters the Tj^ state when it 
surrenders the control of the bus. It can enter the Tj^ state from the T^ or T^. state. When the 
80960KB processor regains control of the L-bus, it enters the T^ state if a new request is 
pending or a T- state if no new request is pending. 
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Notes: 

T, 

Ta - 

Td - 

Tr - 

Tw - 

Th - 



Idle state 
Address state 
Data state 
Recovery state 
Wait state 
Hold state 



READY 


READY asserted 


NOT READY - 


READY not asserted 


BURST 


Multiple word access in progress 


NO BURST 


Multiple word access done , or a 




one-word access 



Figure 3-1 2: L-Bus States with Arbitration 



Arbitration Timing 



Figure 3-13 shows the ar bitration timing diagram. The "T" state represents the last cycle of a 
transaction in which the READY signal was asserted or a T- state. The 80960KB processor 
receives a request to relinquish control of the bus when HOLD is asserted. After the 80960KB 
processor completes the current transaction, it responds to this request by floating the three- 
state output signals and deasserting the open drain output signals. The HLDA output signal, 
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however, remains active and is asserted as the 80960KB processor enters a Tj^ state. During 
the Tj^ state, the CPU ignores all input signals except HOLD and RESET. When the HOLD 
input signal is deasserted, the 80960KB processor exits the Tj^ state and deasserts HLDA. 
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Figure 3-13: Arbitration Timing Diagrann for a Bus Master 



Two 80960KB Processors on the L-Bus 

For the next case, two 80960KB processors reside on the L-bus. During initialization, one is 
designated as the Primary Bus Master (PBM), the other as the Secondary Bus Master (SBM). 
The exchange protocol that is used guarantees that neither device is kept off the bus in- 
definitely. The 80960KB processors use two pins for bus arbitration: the HOLD input pin, and 
the HLDA output pins. These input and output pins for the SBM are interpreted differently, 
however. 



When the SBM is initialized, the pin normally used for HOLD input signal is interpreted as the 
hold acknowledge request (HLDAR) input signal. The assertion of HLDAR indicates that the 
PBM relinquished control of the L-bus. Similarly, the HLDA output signal of the SBM is 
interpreted as the hold request (HOLDR) output signal. The SBM asserts HOLDR to request 
acquisition of the L-bus. Thus, bus arbitration between two 80960KB processors can be 
accomplished by connecting HOLD of the PBM to HOLDR of the SBM, and HLDA of the 
PBM to the HLDAR of the SBM, as shown in Figure 3- 14. 

When using the ponnection shown in Figure 3-14, a delay must be inserted between the input 
and output signals because the minimum clock-to-output delay is less than the maximum hold 
time of the input signals. The delay time must be greater than 5 ns, but less than the clock 
period minus the setup time minus the maximum clock-to-output delay (5 ns < Delay < Tp^^.-^^ 
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Figure 3-14: Arbitration Connection Between Two 80960KB Processors 



Bus states for Two 80960KB Processors 

The state diag ram for the SBM is shown in Figure 3-15. Because there are two 80960KB 
processors, the LOCK signal is included in the state diagram. The SBM requests control of the 
L-bus by asserting HOLDR and subseque ntly en ters the hold request (Tj^^.) state provided that 
the bus is not locked (locked means that LOCK is asserted by the PBM and the SBM has a 
RMW operation pending). The SBM remains in the T^^ state until it acquires control of the 
L-bus by receiving HLDAR, The SBM returns to the T- state by deasserting HOLDR provided 
that the following two conditions exist: 

• A RMW operation is pending 

• The PBM asserted LOCK while the SBM was in the Tj^^. state. 

The SBM gains control of the bus when HLDAR is asserted provided that the bus is not 
locked. After gaining control of the L-bus, the SBM performs the operations, and enters a T^ 
state if necessary. At the end of a transaction, the SBM goes to the T^. state and deasserts 
HOLDR for at least one processor clock cycle to allow another peripheral bus master to gain 
access if needed. If another request is pending, the SBM enters the T^^ state and asserts 
HOLDR provided the bus is not locked. The PBM never forces the SBM off the bus. 
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Figure 3-15: L-Bus States for Secondary Bus Master 
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Arbitration Timing for Two 80960KB Processors on the L-Bus 



Figure 3-16 shows the timing diagram for acquiring and reUnquishing the L-bus by an SBM. 
The SBM enters into the Hold Request (T^^) state and asserts the HOLDR signal. It remains in 
the Tj^j. state until HLDAR is asserted, which indicates that the SBM can utilize the L-bus 
during the next state. When th e bus is no longer required, HOLDR is deasserted during the 
state following the last READY signal. Except for HOLDR, the output signals of the SBM go 
into a high impedance state or are deasserted for the case of open-drain outputs. 




Figure 3-16: Arbitration Timing Diagram for an SBM 



Bus Exchange Example Between Two 80960KB Processors 

Figure 3-17 shows an example of bus arbitration between a PBM and an SBM using the 
arbitration signals. Each bus master performs a one- word read and a two- word write trans- 
action to demonstrate the fastest possible bus exchanges. 
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Figure 3-17: Example of a Bus Exchange Transaction 

While the PBM is performing a read transaction, the SBM requests control of the L-bus by 
asserting HOLDR and entering the Tj^^. state. It remains in this state until the PBM grants the 
request by asserting HLDA after the read transaction is completed. After granting the request, 
the PBM enters the T^ state and remains in this state until its HOLD signal is deasserted. 
When the SBM completes the read transaction, it deasserts HOLDR and gives control back to 
the PBM. 

The PBM now performs a two- word write transaction after deasserting the HLDA. The SBM 
requests control of the bus again by asserting the HOLDR signal and enters the Tj^^. state. 
When the PBM completes the two- word write transaction, it grants the request by asserting 
HLDA and enters the Tj^ state. The SBM receives the signal on the HLDAR input and 
performs a two- word write transaction. When the SBM completes the transaction, the control 
of the L-bus is transferred to the PBM, and both the PBM and the SBM enter the T- state. 

A Peripheral Device As the Default Bus Master 

Another case exists where a peripheral device controls the L-bus, and the 80960KB processor 
requests control of the bus to perform operations. This alternative is not advisable because it 
hinders system performance. The exchange protocol is identical to the one described in the 
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previous section. The 80960KB processor is an SBM and uses two pins for bus arbitration: the 
HOLDR input pin and the HLDAR output pin. The state diagram is similar to the one shown 
in Figure 3-15. The lock conditions are not used for this case, however. 

The peripheral device grants control of the L-bus bus asserting HLDAR when the SBM 
requests use of the L-bus. The peripheral device can obtain control of the L-bus again by 
deasserting HLDAR. If this occurs, the 80960KB processor surrenders control of the bus after 
it completes the current transaction, as shown in Figure 3-18. At that time, the 80960KB 
processor deasserts the HOLDR signal and places the other output signals into a high im- 
pedance state or a deasserted open drain level. The 80960KB processor may request access to 
the L-bus by asserting HOLDR again. 
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Figure 3-18: Forced Relinquishment Timing Diagram for an SBM 



INTER-AGENT COMMUNICATION (lAC) 

The lAC mechanism gives 80960KB processors the capability to send and receive messages to 
one another and to other bus agents. The lAC mechanism is essentially a non-maskable 
interrupt with pre-defined service routines. These routines are implemented in the 80960KB 
processor and are used to perform control functions such as purging the instruction cache, 
setting breakpoint registers, or stopping and starting the processor. By using lAC messages, 
external agents can remotely control the 80960KB. This allows easy integration of the 
80960KB into system environments. 

lAC messages can also be used to generate interrupts that behave exactly the same as 
hardwired interrupts. Since the interrupt vector is encoded in the lAC message, any of the 248 
possible interrupt service routines can be invoked. For further information on lAC message 
definitions see the 80960KB CPU Programmer's Reference Manual. 
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Overview of lAC Operations 

Figure 3-19 shows a typical example of an lAC operation. In this case, an external processor 
gains control of the 80960KB by using an I AC operation. The external processor performs two 
functions: it writes the message in a buffer, called the message buffer; and it asserts the lAC 
pin of the 80960KB processor. Upon receipt of the lAC signal, the 80960KB processor stops 
executing its current process and performs a four-word read of the message buffer. After 
completing the read operation, the 80960KB processor automatically performs a one-word 
write operation to a pre-defined address to acknowledge the receipt of the message. The 
80960KB processor then proceeds to perform the required action. 




NO 



READ MESSAGE BUFFER 
WITH FOUR-WORD BURST 



WRITE TO A PRE-DEFINED ADDRESS 
TO ACKNOWLEDGE RECEIPT OF MESSAGE 



PERFORM f=tEQUEST 



Figure 3-1 9: Example Flow Chart for an lAC Operation 



lAC Messages 



The I AC messages are specifically defined and behave much like machine instructions. The 
80960KB processor reserves the upper 16M bytes (FFOOOOOO^ to FFFFFFFFjj) of the 4M-byte 
address range for I AC message operations. 

There are two types of lAC messages: internal and external. Internal lAC messages allow a 
program to send a command to its own processor. An internal lAC message is sent by writing 
to address FFOOOOlO^j. Internal lAC messages cause no L-bus activity. 
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External lAC messages can be used to send a command to another processor on the L-bus or to 
a remote proce ssor. A processor sends an external lAC message by writing to a buffer area 
and causing the lAC pin of the receiving 80960KB to be asserted. 

When the lAC pin is asserted, the recipient processor reads the reserved address to fetch the 
data from its lAC message buffer. After reading the lAC message buffer, the recipient does a 
write operation to another reserved address to acknowledge receipt of the lAC message. The 
lAC pin is deasserted as a result of this write operation, and the processor is ready to receive 
another I AC. 



Hardware Requirements for External lAC Messages 

To use the external I AC feature of the 80960KB, the following items are needed: a four- word 
message buffer RAM mapped to a reserved address to store the message, logic to assert the 
I AC pin of the 80960KB, and decoding logic to deassert the I AC pin on command from the 
80960KB. 



Message Buffers 

Each 80960KB processor that receives an lAC message must have four 32-bit words of mes- 
sage buffer. This buffer can use special hardware or a reserved area in RAM. For proper 
operation of the buffer, two requirements must be met: the receiving 80960KB must be able to 
read this buffer at FFOOOOIO^ if the receiving 80960KB's Local Processor Number (LPN) is 
equal to zero (see the "RESET and Initialization" section on page 3-33 for details of the LPN), 
or at FF000030h if the LPN is equal to one; and the sending processor must be able to write 
this buffer. 

lAC Pin Logic 

When the I AC message buffer receives a message, logic asserts the I AC pin and keeps it 
asserted. After the 80960KB processor reads the lAC message, it performs a one- word write to 
address FFOOOOOO if its LPN is zero, or FF000020 if its LPN is one. This reserved address 
serves two functions: it causes external logic to deassert the I AC pin, and it maps to a register 
that contains the current processor priority. If the low order three bits of the data word have a 
value of 100b (see Figure 3-20), the external logic should deassert the I AC pin on completion 
of the write operation. 



4 3 2 10 





lllffiP 




3^ ! LI !"i'%f'f F 





1 








ml^^^^^^^^^^^^rf'* p 


p p pB 




1 











^^^^^^^^^^^^^ p p 


p p pH 




1 


1 









ACKNOWLEDGE lAC MESSAGE 
SET PRIORITY 



SET PRIORITY AND 
ACKNOWLEDGE lAC MESSAGE 



Figure 3-20: Data Settings 
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EXTERNAL PRIORITY REGISTER 

The 80960KB contains an internal register that keeps track of the current priority (a value 
between and 31) at which it is executing. This priority is used to decide whether or not to 
service interrupts ~ higher priority interrupts are serviced, others are posted for later servicing. 
In some system designs it may be desirable to have this priority visible outside of the proces- 
sor. To allow this, the 80960KB provides support for an external priority register. Whenever 
the priority of the 80960KB changes, the contents of this register are automatically updated. 

This feature may be enabled in two steps. If the Write External Priority bit is set in the PRCB 
(see the 80960KB CPU Programmer' s Reference Manual), then the external priority register is 
updated as a result of a MODPC instruction or whenever an interrupt occurs. If external lAC 
messages are enabled, then external priority is also updated whenever a result of an lAC is to 
change processor priority. 

Hardware Requirements 

The 80960KB expects to write its priority into a 5 -bit register mapped to address FFOOOOOO if 
its LPN is zero, or FF000020 if its LPN is one. To set the priority, the processor performs a 
one- word write operation in the form shown in Figure 3-20. The priority is contained in 
bit2o-bit^^, and bit3 is asserted to indicate that the priority is changed. It is necessary to use bit3 
as a qualifier to distinguish priority write operations from lAC message acknowledgments, 
which use the same reserved address. 



External Priority and lAC Messages 

The externa l prio rity register can be used to filter I AC messages. Since the processor always 
services the lAC pin (i.e., it is non-maskable), a low priority lAC message can interrupt a high 
priority task. To prevent this, a system can associate a priority with each lAC message. This 
priority can then be compared to the priority stored in the external priority register and used to 
decide whether or not to accept the I AC message. One way to associate a priority with an I AC 
message is to encode the message priority into the lAC message destination address as shown 
in Figure 3-21 . The range of reserved addresses shown in Figure 3-21 have been set aside for 
this purpose. 
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Figure 3-21 : Physical Address Interpretation for lAC Messages 
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INTERRUPTS 

The 80960KB processor responds to external events occurring at arbitrary times by means of 
an interrupt signal. Various sources, which include hardware components and special software 
instructions, generate an interrupt signal that can suspend execution of the 80960KB 
processor's current instruction stream. The hardware-generated interrupts are discussed in this 
section. For complete information on software-generated interrupts, see the 80960KB CPU 
Programmers Reference Manual. 

The 80960KB processor provides a flexible interrupt structure. The 80960KB processor can 
be interrupted using any of three methods below: 

• Receipt o f a s ignal on any or all of the four direct interrupt input signals (INTq, INTj, 
INT^, and INT3) 

• Receipt of a signal on the interrupt request (INTR) line to obtain an external interrupt 
vector 

• Receipt of an I AC message from a processor program or external source. 

The choice of the method is determined by the setting in the on-chip Interrupt Control register. 
Interrupt signals can occur during any bus state regardless of which method is implemented. 

This section provides details on the multiplexed interrupt pins, the three interrupt methods, the 
Interrupt Control register, synchronization, and interrupt latency. 

Interrupt Signals 

The interrupt sign als a r e multi plexed on four pins of the 80960KB processor: INTq/IAC, INTj, 
INT2/INTR, and INT3/INTA. The on-chip Interrupt Control register determines how these 
pins are used (see "Interrupt Control Register" section on page 3-30). 

INTq/IAC This pin multiplexes the InterruptQ and Inter- Agent Communication 

request input signals. The 80960KB processor interprets this input sig- 
nal as either INTq or lAC. The INTq signal indicates a request for 
interrupt service when it is asserted. The lAC signal denotes that a 
message is waiting when it is asserted. 

INTj The Inter rupt^ input signal indicates a request for interrupt service 

when it is asserted. 

INT2/INTR This pin multiplexes the Interrupt2 and Interrupt Request input sig- 

nals. The 80960KB processor interprets this input signal as either INT2 
or INTR. The INT2 signal indicates a request for interrupt service when 
it is asserted. The INTR signal indicates an interrupt request from an 
external interrupt controller. The 80960KB processor responds with an 
interrupt-acknowledge sequence. To ensure an interrupt, the INTR sig- 
nal must remain asserted until the first cycle of the interrupt- 
acknowledge transaction. 



INT3/INTA This pin multiplexes the Interrupt3 input signal and Interrupt 

Acknowledge output signal. The 80960KB processor uses this pin as 
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the INT3 input signal or as the INTA output signal. The Inter rupt C on- 
trol register setting selects either the combination of INTR/INTA or 
INT2/INT3. The INT3 input signal indicates a request for interrupt ser- 
vice when it is asserted. INTA acknowledges the interrupt request from 
an external interrupt controller. The INTA signal is latched by the 
80960KB processor and remains valid during the T^ state. This signal is 
open drain output. 



Interrupt Control Register 

The 80960KB processor uses a 32-bit, on-chip Interrupt Control register to define the function 
of the multiplexed interrupt pins. This 3 2-bi t Interrupt Control register allocates eight bits for 
each of the four direct interrupt signals (INTq, INTj, INT2, and INT3). The eight bits contain 
the vector number for each interrupt signal, as shown in Figure 3-22. The vector number is 
automatically read when one of the interrupt sig nals (INTq, INTj, INT2, and INT3) is activated. 
For example, when an interrupt is signaled on INTq, the 80960KB processor uses bity-bitg of 
the Interrupt Control register as the vector number. 

The 80 960KB pro cessor uses the data field corresponding t o INT ^^ to determine identification 
of the INTq/IAC input pin; a value of OOj^ signifies the lAC function. If the data field 
corresponding to INT2 has a value of 00^^, t he 809 60KB proc essor in terprets the INT2/INTR 
pin as the INTR input signal, and the INT3/INTA pin as the INTA output signal. In other 
words, this setting specifies that the 80960KB processor should use these two pin s for c om- 
munication with an external int errupt controller. If the functions of INTR and INTA are 
selected, the direct interrupt pins (INTq and INTj) can still be used. 

The on-chip Interrupt Control register may be read and written by the Synchronous Load 
(synld) and Synchronous Move (synmov) instructions at the address FF000004j^ (see the 
80960KB CPU Programmer' s Reference Manual). The value of the data fields in the Interrupt 
Control register is FFO OOOOO^ after initi alizat ion. This setting specifies that the four interrupt 
pins function as INTA, INTR, INT ^ , and lAC. 
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Figure 3-22: interrupt Control Register 
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Using the Four Direct Interrupt Pins 

The 809 60KB processor ca n be interrupted by asserting any or all of the four interrupt input 
signals (InTq,J[NTp INT2, INT3). If the signals are simultaneously asserted, the 80960KB 
assumes that INTq has the highest priority, followed by INTj, INT2, and INT3. Software 
should follow this convention when programming the Interrupt Control register. When the 
interrupt input signals are asserted, the 80960KB processor utilizes a vector number specified 
by the Interrupt Control register as an index to an entry in the interrupt table located in 
memory. For complete software information on this topic, see the 80960KB CPU 
Programmer s Reference Manual. 

Using an External Interrupt Controller 

The 80960KB processor can communicate with an exter nal int errupt controller by performing 
an interrupt acknowledge sequence using the INTR and INTA signals. Figure 3-23 shows an 
example of the timing of an interrupt acknowledge sequence using the 8259A Programmable 
Interrupt Controller. 




Figure 3-23: Timing Diagram for Interrupt Acknowledge Transaction 
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INTR is asserted by the 8259A and remains asserted until the 80960KB processor activates the 
INTA signal for the first time. When the 80960KB processor receives an interrupt request, the 
CPU co mplete s the current transaction (or comes to some interruptible point), and asserts 
INTA. INTA remains valid through the T^, T^, and T^ states. The first assertion of INTA 
triggers the 8259A to resolve priority among its interrupt requests. 

To compensate for the timing of the 8259A, the 80960KB processor automatically inserts five 
T- states before asserting the INTA again to read the interrupt vector. Figure 3-23 shows 
READY asserted without a wait state during the first Interrupt Acknowledgement cycle and 
with one wait state during the second Interrupt Acknowledgement cycle. In practice, the 
8259A would require about four wait states in both cycles. The address during the T^ state for 
both interrupt acknowledge cycles is FFFFFFFCj^. For more details, see the "8259 A 
Programmable Interrupt Controller" section in Chapter 5 on page 5-5. 

The 80960KB processor services the interrupt according to its priority. If the interrupt has 
higher priority than the current activity, the 80960KB processor services it immediately. 
Otherwise, after reading the interrupt vector, the 80960KB processor posts the interrupt vector 
in the interrupt table. Typically, the 80960KB processor responds within 4 \xs for an interrupt 
with higher priority than the current process (assuming CLK2 at 40 MHz). If the interrupt has 
lower priority than the current activity, the interrupt is serviced when its priority is higher than 
the priority of the subsequent activity of the 80960KB processor. 

Using lAC Requests for Interrupts 

The 80960KB processor can also be interrupted by an lAC message. The 80960KB processor 
can send lAC messages to itself by using one of the Synchronous Move instructions. Because 
this message does not utilize the L-bus when sent to the same processor, no special hardware is 
required. More details are provided on page 3-25 and in the 80960KB CPU Programmer's 
Reference Manual. 

Synchronization 

The INTq/IAC, INTj, INT^/INTR, and INT3 input signals can be either synchronous or 
asynchronous to the system clock (CLK2). Synchronous interrupt signals must be set up 3 ns 
prior to the rising edge of CLK2 and held for 10 ns aft er the risin g edge of CLK2. To properly 
preset the interrupt signals for synchronous operation, INTq/IAC, INTj, INT2/INTR, and INT3 
must be deasserted for at least one processor clock cycle and asserted for at least one processor 
clock cycle. These signals may be deasserted and asserted individually. 

If the interrupt signals are asynchronous to CLK2, the 80960KB processor internally 
synchronizes them. For the CPU to recognize the asynchronous interrupt input signals, they 
must be preset by deasserting them for at least two processor clock cycles, and then asserting 
them for at least two processor clock cycles. These signals may be deasserted and asserted 
individually. 
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RESET AND INITIALIZATION 

The system RESET signal provides an orderly way to start or restart the 80960KB processor. 
When the 80960KB processor detects the low-to-high transition of RESET, it terminates all 
external activities and places the output pins in the high impedance state or deasserted con- 
dition. When the RESET signal falls low again, the 80960KB processor begins the initializa- 
tion process and later starts fetching instructions from a specific address. 

RESET Timing Requirements 

To properly reset the 80960KB processor to a known state, the low-to-high transition of 
RESET must be asserted relative to any rising edge of CLK2 and remain asserted for at least 
41 CLK2 cycles, as shown in Figure 3-24. RESET must be deasserted after the rising edge of 
CLK2, but prior to the next rising edge of CLK2. This establishes the next rising edge of 
CLK2 as edge A. 
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Figure 3-24: RESET Timing Diagram 



RESET Timing Generation 

The RESET input signal to the 80960KB processor can easily be generated by implementing a 
synchronization circuit comprised of a two D-type flip-flops, as shown in Figure 3-25. 
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Figure 3-25: Asynchronous RESET Circuit 
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The user RESET signal is synchronized with the CLK signal by applying CLK to the clock 
input of both flip-flops. To protect against a metastable RESET signal, the output of the first 
flip-flop, SYNC, is applied to the input of the second flip-flop. The output of the second 
flip-flop resul ts in a processor RESET signal. The timing diagram for these sign als is shown in 
Figure 3-26. CLK or CLK2 can be used instead of CLK in Figure 3-26. Using CLK provides 
an edge A corresponding to the rising edge of CLK. 
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Figure 3-26: Diagram for RESET Timing Generation 



This circuit assumes an asynchronous user RESET signal. If the user RESET signal is already 
synchronous with the CLK signal, the same circuitry can be implemented as shown in Figure 
3-27. In this case, however, the output from the first flip-flop is used to generate the processor 
RESET signal rather than being routed to the input of the second flip-flop. 
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Figure 3-27: Synchronous RESET Circuit 



Initialization 



The initiaUzation sequence of events is shown in Figure 3-28. When RESET is deasserted 
afte r a minimu m of 41 CLK2 cycles, several actions take place: two input pins are sampled, 
the FAILURE output signal (see next section for the pin description) is asserted, and the 
self-test is performed. 
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Figure 3-28: Initialization Flow Chart 
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When RE SET is deasserted, the 80960KB processor samples the signals residing on the 
INTq/IAC and the BAD AC input pins (see the next section for the pin description of BAD AC). 
At this time, these pins are interpreted as the Local Processor Number (LPN) and Startup 
(STARTUP) signals, respectively. The LPN input signal defines whether the 80960KB 
processor is a PBM (high voltage input level) or a SBM (low^ voltage input level). The 
STARTUP input pin indicates whether the 80960KB processor performs initialization (high 
voltage level) or not (low voltage level). The STARTUP signal is used to allow one or more 
processors to perform the active initialization. The input voltage levels for the LPN and 
STARTUP must be setup 3 ns before the rising CLK2 edge prior to edge A and held 10 ns 
beyond edge C, as shown in Figure 3-29. 



CLOCK EDGES 



CLK2 



CLK 



CPU 
RESET . 



41 CLOCK 
-CYCLES - 



OUTPUTS 




INTo/IACandBADAC 
must be set prior to 
this clock edge. 



INTo/IACandBADAC 
must be held beyond 
this clock edge. 



Latched interrupt 
signals cleared prior 
to first instruction. 



Figure 3-29: RESET Signal Timing Relationship 



Besides sampUng the two input pins, the 8Q96QKB processor asserts the FAILURE output 
signal a few cycles after RESET is deasserted. The FAILURE signal remains asserted while 
the CPU performs the self-test. If a failure is detected during the self-test, FAILURE remains 
asserted and the CPU enters the stopped state where the p rocessor does nothing. If the self- test 
completes successfully, the CPU deasserts the FAILURE signal. 



An 80960KB processor that is designated as the initialization processor proceeds by doing a 
checksum test of eight words fetched from memory at physical address 0000 OOOOj^ to ensure 
that the me mory and L-bus are operating properly. If the initial checksum is incorrect, then the 
FAILURE signal is asserted (and remains asserted) and the 80960KB processor enters the 
stopped state. After a successful checksum test, the 80960KB processor uses some of the 
words as addresses to initial data structures. Complete details are provided in the 80960KB 
CPU Programmer' s Reference Manual. 

Just prior to executing the first instruction, the 80960KB processor clears any latched interrupt 
signals. 
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ERROR SIGNALS 



The 80960KB processor provides an input signal (BADAC) for notification of an error in the 
system, and provides an output signal (FAILURE) for notification of an error within the 
processor. 



BADAC 



FAILURE 



When asserted, the Bad Access input signal indicates that an un- 
recoverable error occurred during the current data transfer. If, however, 
BADAC was asserted after a Synchronous Move or Synchronous Load 
inst ruction, t he error is recoverable. The 80960KB processor samples 
the BADAC input signal during the cycle following the one when the 
last READY is asserted. 



The FAILURE signal indicates that an error occurred d uring initializa- 
tion. The 80960KB processor always asserts FAILURE after the activa- 
tion of the RESET signal. If a failure is detected during a self-test, 
FAILURE remains asserted. Otherwise, the processor deasserts 
FAILURE after a successful self-test is performed. If the initial memory 
checksum is incorrect, the initialization processor asserts FAILURE a 
second time, and keeps it asserted. FAILURE is an open drain output 
signal. 



SUMMARY 

The L-bus is a high speed 32-bit multiplexed bus with burst-transfer capability and is designed 
to operate with the high performance 80960KB processor. The L-bus consists of two signal 
groups: address/data, and control. These signal groups are utilized by the 80960KB processor 
to perform read, write, and burst transactions. 

The arbitration, interrupt, and reset operations are related to the L-bus transactions. The 
arbitration operation transfers control of the L-bus to another bus master. Three methods are 
available to handle interrupts: by invoking the on-chip interrupt controller, by employing an 
external interrupt controller using the INTR/INTA signals, by using an lAC message. The 
reset function sets the 80960KB processor to a known internal state after it successfully 
completes the self-test. These operations offer power and flexibility to hardware system design 
using the 80960KB processor. 

This chapter focuses on the L-bus and how it relates to the 80960KB processor. The next two 
chapters develop guidelines on designing memory and peripheral devices in the hardware 
system. 



3-37 



Memory Interface 



CHAPTER 4 
MEMORY INTERFACE 



The high-speed bus interface has many features that enhance high-performance designs. In 
particular, the burst-transfer feature allows up to four successive 32-bit data word transfers at a 
maximum rate of one word every processor clock cycle. This chapter outlines approaches for 
memory designs that use these features, describes memory design considerations, analyzes the 
timing, and lists a number of useful examples. The concepts illustrated by these examples 
apply to a wide variety of memory system implementations. 

BASIC MEMORY INTERFACE 

Figure 4- 1 shows the major logic blocks of the memory interface circuit. The data transceivers 
buffer the data to compensate for any slow devices that may be connected to the 80960KB 
processor. The address latches demultiplex the address/data signals from the 80960KB proces- 
sor and latch the address. The address decoder selects the appropriate memory device from the 
latched address. To accommodate a memory burst transaction, the burst logic decrements the 
word count, increments the local address lines 3 and 2 (LAD3 an d LAD^) , and generates a 
CYCLE-IN-PROGRESS signal. The timing control generates a READY signal and other 
specific signals required by a particular memory device. The byte enable latch stores the byte 
enable signals. 

Although not part of the basic memory interface, the DRAM controller, SRAM interface, 
DRAM, SRAM, and EPROM are included in Figure 4-1 for completeness. In a hardware 
system the DRAM, SRAM, and EPROM are typically located in separate subsystems. 

Although the memory interface circuit can be designed using programmable logic, gate arrays, 
or other custom logic, the examples use standard components wherever possible to illustrate 
the design concepts. 

Data Transceivers 

Standard 8-bit transceivers can be used to provide isolation and additional drive capability for 
the L-bus. Transceivers can be used to prevent bus contention that can occur if some memories 
are slow to remove data from the L-bus after a read operation. For example, if a write 
operation follows a read operation, the 80960KB processor may drive the L-bus before a slow 
device has removed its output data, potentially causing a current spike on the power and 
ground lines. Transceivers, however, can be omitted if the data float time of the device is short 
enough and the load does not exceed the 80960KB device specifications. 

The data transceivers_can be controlled by tw o signals from the 80960KB processor: data 
transmit/receive (DT/R) and data enable (DEN). DT/R indicates the direction of data flow and 
DEN enables the transceivers. 
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Figure 4-1 : Simplified Block Diagram for Memory Interface Logic 



Address Latch/Demultiplexer 

Conventional transparent latches can be used to demultiplex the address/data lines of the 
80960KB processo r and to hold the address constant during t he me mory operation. The latch 
is controlled by the ALE signal from the 80960KB processor. ALE passes through an inverter, 
so that when ALE goes low, the address flows through the latch. The low-to-high transition of 
ALE can be used to latch the address. The output enable of the latch can be tied to ground. 
The lower four address lines (LAD3-LADQ) are latched by the burst logic. 

Address Decoder 

The 80960KB processor accesses both memory and I/O devices by supplying a 32-bit address 
and a read/write command. The address decoder determines which particular memory or I/O 
device is selected by decoding the address lines. The following discussion focuses on memory 
selection, and the "Address Decoder" section in Chapter 5 on page 5-3 discusses I/O device 
selection using memory -mapped I/O techniques. 
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The memory address can be divided into regions where one region can apply to EPROM or 
ROM, another to RAM, and another to the I/O registers. In a 80960KB-based system the 
ROM address space is likely to start at address 0000 OOOOj^ because the CPU begins execution 
at this address. The RAM or I/O regions can start at any other address in the 4G-word address 
range except for addresses FFOOOOOO^ through FFFFFFFFj^, which the 80960KB processor 
reserves for inter-agent communication. 

Because of the large address range of the 80960KB processor, the address can be divided into 
word address bits and chip select bits. Typically the higher-order address bits are decoded to 
generate the selection signal for ROM, RAM, or I/O devices. 

The address decoder can be located either before or after the address latches. Usually, it is 
placed after the latches, so that the chip-select signal does not need to be latched. Figure 4-1 
shows the block diagram of the address decoder placed behind the address latches. 

Burst Logic 

To enhance system performance, the 80960KB processor performs burst transactions that 
transfer up to four data words at a maximum rate of one word every clock cycle. A DRAM 
controller can be designed that takes advantage of the burst-transfer capability by using the 
static column mode or nibble mode features of the DRAM (see the "DRAM Controller" section 
on page 4-5). This DRAM controller requires a signal, called CYCLE-IN-PROGRESS, to 
identify the start and end of a memory cycle. The burst logic generates the CYCLE-IN- 
PROGRESS signal. 



Figure 4-2 shows the flow chart for the burst logic. If ADS is low and DEN is high, then the 
burst logic latches LAD3 through LADq, and asserts the CYCLE-IN-PROGRESS signal. The 
burst logic checks the SIZE signals (LADj and LADq). If the value of the SIZE signals equal 
zero, then the burst logic runs one memory cycle, and terminates the CYCLE-IN-PROGRESS 
signal. If the value of the SIZE signals do not equal zero, the burst logic runs one memory 
cycle, increments the address (LAD3 and LAD2), and decrements the value of the SIZE 
signals. When this is finished, the burst logic checks the value of the SIZE signals again. 
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Figure 4-2: Burst Logic Flow Cliart 
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The burst logic can be used with memory devices, such as EPROM, SRAM, or DRAM without 
static column mode or nibble mode, which do not inherently support a burst transaction. 
Because the 80960KB processor ensures that a burst transaction cannot exceed four words or 
cross a 16-byte boundary, incrementing LAD3 and LAD2 after a single data word transfer 
makes the burst transfer transparent to these devices. 

Timing Control Logic 

The timing control logic accommoda tes memo ry devices that cannot transfer information at the 
maximum bus rate by generating a READY signal when the data is available. The timing 
control logic consists of a counter and timing logic, as shown in Figure 4-3. The counter 
produces a 4-bit binary count. Th e count b egins when the CYCLE-IN-PROGRESS signal is 
asserted. Th e timing logic assert s READY at the appropriate t ime based upon the count, the 
EPROM-CS, an d the SR AM-CS signals. For a burst transfer, READY resets the counter to 
properly time a READY signal for the next data transfer. When CYCLE-IN-PROGRESS is 
deasserted, the clock counting is terminated. 
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Figure 4-3: Timing Control Logic Block Diagram 

Because the timing of DRAM is more complicated, the DRAM controller generates a 
DRAM-R DY signal to the 80960KB processor. In addit ion, the cloc k c ount, the W/ ^ com- 
mand, and SRAM-CS signal can also be used to generate SRAM-WE and SRAM-OE signals. 
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Byte Enable Latch 

The byte enable latch holds the byte enable signals constant until the DRAM controller or 
SRAM interface uses the signals. As mentioned in the "L-Bus Signal Groups" section in 
Chapter 3 on page 3-4, the byte enable signals specify which bytes (up to four) on the 32-bit 
data bus are transferred during the data cycle. Each individual byte enable signal selects eight 
data lines as shown in Table 4-1. 

Table 4-1 : Byte Enable Signal Decoding 



BYTE ENABLE SIGNAL 


ADDRESS LINE SELECTION 


BEo 


LAD7.LAD0 


Te, 


LADis.LADa 


BE2 


LAD23.LAD,6 


g-Ea 


LAD3,.LAD24 



The byte enable signals are valid from the 80960KB processor before data is transferred. 
These signals are asserted during the address cycle for the first data word transfer; they are 
asserted again during the first data cycle for the second word transfer; the second data cycle for 
the third word transfer; and the third data cycle for the fourth word transfer. For each wo rd, 
the byte enable signals remain valid throughout every data or wait cycle until READY is 
asserted. After READY is asserted, the byte enable signals change during the next processor 
clock cycle. 



The ALE signal can be used to latch the first byte enable signals. READY can be used to latch 
the other byte enable signals for each word. 



SRAM INTERFACE 

The basic memory interface can be used in conjunction with the SRAM interface to read and 
write to SRAM. This section describes the SRAM interface and examines the timing. 



SRAM Interface Logic 



The SRAM interface logic uses the latched byte enable signals, the SRAM- WE, and th e 
SRAM-OE signals to generate four output enable sig nals (SRAM -OE,^ through SRAM-OEq) 
and four write enable signals (SRAM-WE3 through SRAM-WEq), as shown in Figure 4-4. 
These signals allow the 80960KB processor to write to the data byte that is specified by the 
byte enable signals. SRAMs with separate OE and CS signals require only one OE signal per 
bank since the 80960KB ignores unrequested bytes in read operations. 
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Figure 4-4: Block Diagram for SRAM Interface 



SRAM Timing Considerations 

This section analyzes the critical timing paths of the SRAM control signals. From the critical 
path, the timing equations can be derived to determine the memory access time for no wait 
state operation. 

When evaluating critical timing paths, the timing calculations should use worst-case parameter 
specifications, rather than typical specifications. By using worst-case timing values, reliable 
operation is assured over all variations in temperature, voltage, and individual device charac- 
teristics. These timing values are determined by assuming the maximum propagation delay to 
latch an address, select a memory device, and pass through data buffers and transceivers. 
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Figure 4-5 shows the critical timing path for a one- word SRAM read operation. The diagram 
consists of three time periods: the address setup period (T^^^j-^gj), the memory response period 
(Tj^g^), and the data return period (T^^^^g^^). Note that the timing for the read command and 
ou^ut control signals does not enter into the critical timing path. 




Taddrset 



^ — ' mem 



Tdataset 



SRAM READ CYCLE 



Figure 4-5: Critical Timing Path for SRAM Read Operation 

During the T^^^^.^^^ period, th e 809 6QKB processor outputs a valid address tha t is latched on the 
low-to-high transition of the ALE signal. The address decoder generates the SRAM-CS signal 
from the latched address. During the T^^^^^^ period the SRAM responds to the commands and 
signals and retrieves the data. The access time of the memory determines the duration of the 

Tmem period. T 

signal. 



can be varied in increments of clock cycles by delaying the READY 



The data must be available at the address/data pins of the CPU before the end of the data state. 
The T^^^^g^^ period must take into account the setup time requirement of the 80960KB proces- 
sor and the throughput delay of a data transceiver. 

For a no wait state operation, the data transfer word must be completed in two system clock 
(CLR) cycles. The minimum time period for a no wait state operation (T^g^j^.j^^.^^j^) can be 
determined by using equation 1. 



T = 2CT K T T 

mem-no-wait ^^^^ " a ^ddrset ~ dataset 



(1) 



where: T 



mem-no-wait 
2CLK 



: Memory access time for no wait state operation 
: Two system clock (CLK) cycles 



addrset 



: Maximum delay to valid address 
+ Maximum throughput delay of address latch 
+ Maximum delay to generate chip select 



^dataset ~ Maximum delay through data transceiver 

+ Maximum data setup time of CPU 
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A similar analysis can be done for burst transactions. Equation 1 can be used to determine the 
access time for no wait state operation of the first word. For subsequent words, equation 2 can 
be used. In this equation, the address setup time is replaced by delay in the burst logic to 
change the address (T^^ui-sj). In this case, the data transfer of each subsequent word must be 
completed in one system clock (CLK) cycle (no address state). The minimum access time for a 
no wait state operation (Tj^em-no-wait^ ^^^ ^^ determined by using the lesser value of equation 1 
or equation 2. 



^ mem-no-wait ^I^^ " I burst ~ ^ dataset ^^^ 

where: Tj^em-no-wait ~ Memory access time for no wait state operation 

CLK = One system clock (CLK) cycles 

T^^^^^ = Maximum delay to change the address 

^dataset ~ Maximum delay through data transceiver 

+ Maximum data setup time of CPU 



The memory access time can be extended by delaying the READY signal and adding wait 
states. 

The timing analysis described for a SRAM read operation can be used for EPROM timings. If 
EPROMs are only used to store initialization programs, they are seldom accessed compared to 
memory devices used to store program data or instructions. Consequently, the addition of wait 
states during the read cycle does not affect overall system performance. 

Figure 4-6 shows the critical timing path for an SRAM write operation. The diagram consists 
of two time periods: the address setup period (T^^drset^ ^^^ ^^^ memory response period 

(T ). 

^ mem^ 
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Taddrset 



SRAM WRITE CYCLE 



Figure 4-6: Critical Timing Patii for SRAM VVrite Transaction 

During the T^^^^.^^^ period, the 8Q960KB processor outputs a valid address that is latched on the 
low-to-high transition of ALE. The address decoder generates the SRAM-GS signal from the 
latched address. 

During the T^^^^^ period the SRAM responds to the commands and writes the data. The access 
time of the memory determines the dura tion of t he T^^.^ period. T^^^^^ can be varied in 
increments of clock cycles by delaying the READY signal. 

Two timing paths should be considered during the T^^^^^ period: the path where data is supplied 
to the memory, and the path that monitors the memory write cycle time. The first path takes 
into account the time for the 80960KB processor to generate valid data, the throughput delay of 
a data transceiver, and the data setup time requirement of the memory. The second path is the 
memory write cycle specification. The longer of the two paths is the critical timing path. 

By examining the timing path required to operate the SRAM, equation 2 can be derived which 
determines SRAM write cycle time for no wait state operation. The memory cycle time is 
determined by the lesser value of equation 1 or equation 2. 
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^ mem-no-wait ^^^^ " ^ addrset ^^^ 

where: Tj^em-no-wait - Maximum delay to valid data 

+ Maximum throughput delay of data transceiver 
+ Maximum data setup time of memory 

2CLK = Two system clock (CLK) cycles 

^addrset ~ Maximum delay to valid address 



4- Maximum throughput delay of address latch 
+ Maximum delay to generate chip select 



The memory access time can be extended by delaying the READY signal and generating wait 
states. 



DRAM CONTROLLER 

This sectio n pro vides design guidelines for a DRAM controller. DRAMs offer static column 
mode and CAS before RAS refresh features. This section shows guidelines on how to use 
these features with the burst capability of the 80960KB processor to significantly enhance 
system throughput. 

The DRAM controller multiplexes the address into a row and column address, performs the 
refresh operation, arbitrates between a refresh request and memory request, and generates the 
necessary control signals for the DRAM. To implement these functions, the memory controller 
uses an address multiplexer, arbiter, refresh interval timer, and DRAM timing and control as 
shown is Figure 4-7. 

A standard DRAM controller can be used, but it typically degrades system performance. 
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Figure 4-7: DRAM Controller Block Diagram 
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Address Multiplexer 

The address multiplexer divides the DRAM address into a row and column address. The 
proper select ion of a row or column address is accomplished by the row/column select signal 
(ROW/COL) from the DRAM timing and control circuit. 

Refresh Interval Timer 



The refresh interval timer periodically generates a refresh request (REF-REQ) by counting 
enough bus cycles to equal the refresh interval period. Since a refresh request is processed 
after a completed operation, the refresh period must take into account the time required to 
perform a bus operation, as well as the DRAM refresh specification. For example, a IM-bit 
DRAM that requires 512 refresh cycles within 8 ms needs a refresh cycle every 15.6 |lis. To 
meet the DRAM specification, the refresh interval timer must generate a refresh request in less 
than 15.6 |is to compensate for any required time to complete the operation with wait states. 



After the REF-REQ signal is generated, the arbiter sends a refresh acknowledge signal 
REF-ACK back to the interval timer to assure that refresh occurred before generating another 
REF-REQ. 

Arbiter 

DRAM controller uses an arbiter to decide whether a memory cycle or refresh cycle is per- 
formed. In a synchronous design, arbitration is easily performed because memory and refresh 
cycle requests never occur at or near the same time. 

The arbiter monitors memory cycle requests a nd refresh re quests. The arbiter detects a DRAM 
memory re quest by decoding two signals: DRAM-CS and CYCLE-IN-PROGRESS. The 
REF-REQ signal indicates that a refresh cycle must be performed. The arbi ter ar bitrates 
between a memory cycle or refresh cycle and generate s a M emory/Refresh (MEM/REF) signal. 
The DRAM timing and control block uses the MEM/REF signal to start the generation of the 
control signals. 



When a refresh cycle is performed, the arbiter sends a REF-ACK signal to the refresh timer, 
which uses this signal to begin another count. 

DRAM Timing and Control 

The DRAM timing and control circuit is the final logic block and core of the DRAM con- 
troller. The functions of this circuit include the following: 



Generating the DRAM control signals (RAS, CAS, and WE) with the proper timing 
relationships during system operation 



Generating the DRAM-RDY signal 

Performing the refresh function by asserting CAS before RAS 
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• Performing several warm-up cycles required by the DRAM when power is first applied. 

The DRAM timing and control logic can be designed to take advantage of the burst-transfer 
capability of the 80960KB processor by implementing static column mode or nibble mode. 
With nibble mode, a multiplexed address is applied to the D RAM, and up to four bits of data 
are quickly transferred by successively toggling the CAS pulse. The DRAM timing and 
control logic can be designed to provide the successive CAS pulses by using the CYCLE-IN- 
PROGRESS and DRAM-RDY signals. Static column mode can also be used to take 

advantage of the burst capability of the DRAM. Static column mode allows fast access to the 
bits located in the selected row of the DRAM simply by changing the column address after the 
first access. 

Figure 4-8 shows a flow chart for the DRAM timing and control logic using static column 
mode. The DRAM timing and control circuit receives a refresh request or a memory request 
on the MEM^REF and CYCLE-IN-PROGRESS input signals. For a memory request, the 
DRj\M timing and control determines whether a read or a write operation is desired from the 
W^ signal from the 80960KB processor. 
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Figure 4-8: Flow Chart for DRAM Timing and Control Logic 

For a read operation, the DRAM timing and control logic performs seve ral fu nctions: it brings 
ROW/COL high to select a row a ddress; it assert s RASq; it brings ROW/COL low to select the 
column address; it asserts CAS^ through CAS^^ (deriv ed from the f our latched byte enable 
signals); and it generates a DRAM-RDY signal. The DRAM-RDY signal causes the burst 
logic to increment the address and the 80960KB processor to read the data word. 
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After completing these functions the DRAM timing and control logic samples the CYCLE-IN- 
PROGRESS to determine whether t o tran sfer another data word. If so, the DRAM timing and 
control logic maintains the ROW/COL signal low to select the new column address and 
generates another DRAM-RDY. The DRAM timing and control logic repeats the procedure 
until all the data words are transferred. Then the DRAM timing and control logic deasserts 
RASq and observes the necessary row precharge specification of the DRAM. 

For a write operatio n, th e DRAM timing and control logic performs similar functions on the 
first word: it assert s WE; it brings ROW/COL high to select a r ow ad dress; it asserts RASq; it 
brings ROW/COL low to select the column address; it asserts C AS^ through C AS^^ (derived 
from the fou r latched byte enable signals); and it generates a DRAM-RDY signal. The 
DRAM-RDY signal causes the burst logic to increment the address and informs the 80960KB 
processor that the data word was written. 

After completing these functions the DRAM timing and control logic samples the CYCLE-IN- 
PROGRESS to determine whether to transfer another data word. If so, the DRAM timing and 
control logi c mai ntains the ROW /COL signal lo w to s elect the new column address, deasserts 
and asserts CAS3 thro ugh CAS^^ to observe the CAS precharge specification of the DRAM, 
and generates another DRAM-RDY. The DRAM timing and control logic repeats the proce- 
dure unti l all the data words are transferred. Then the DRAM timing and control logic 
deasserts RASq and observes the necessary row precharge specification of the DRAM. 



Although only one RAS signal is genera ted, f our CAS signals (CAS3-CASQ) are generated to 
enable each byte of the L-bus. These CAS signals are triggered by the four write signals 
generated by the byte enable decoder and correspond to the byte enable signals of the 
80960KB processor. For example, CASq, which is mapped directly from BEq, selects the 
least-significant data byte (LAD^-LADq). 



A single WE control signal and four CAS signals ensure that only those DRAM bytes selected 
for a write cycle are enabled. All other data bytes maintain their outputs in the hig h-imp edance 
state. A common des ign error is to use a single CAS control signal and four WE control 
signals, using the WE signals to write the DRAM bytes selectively in write cycles that use 
fewer than 32 bits. Although the selected bytes are written correctly, the unselected bytes are 
enabled for a read cycle. These bytes output their data to the unselected bytes of the data bus 
while the data transceivers output data to every bit of the data bus. When the two devices 
simultaneously output data to the same bus, bus contention occurs. 



The refres h function can be performed simply by asserting CAS signal before asserting RAS. 
The CAS before RAS r efresh feature eliminates the need for an external refresh address 
counter. When the CAS pulse is activated prior to the assertion of the RAS pulse, the DRAM 
automatically performs a refresh cycle on one row by employing an on-chip address counter. 
Upon completion of the refresh cycle, the address counter is automatically incremented. The 
MEM/R EF sig nal fro m the arbiter can be used by the DRAM timing and control logic block to 
initiate a CAS before RAS refresh cycle. 



Besides generating the RAS, CAS, and WE signals, the DRAM timing and control logic 
generates a number of warm-up cycles for the DRAM after reset by issuing several refresh 
requests. 
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Timing Considerations for the DRAIV! Controller 

Figure 4-9 shows a typical example of a timing diagram for a two-word read transaction that 
uses static column mode; similarly, Figure 4-10 is a typical example for a two- word write 
transaction. The example assumes a memory access time that requires two wait states (T^) for 
the initial data word and one wait for the second data word. 

The critical timing areas for both read and write transactions are noted by circled numbers in 
the diagrams, which are explained in the enumerated list below. 

1. The delay for the CPU to generate a valid address. 

2. The delay for the DRAM timing and control logic to generate the CYCLE-IN-PROGRESS 
signal. 

3. The delay to generate the DRAM row address. This time includes the address latch 
throughput delay, the multiplexer throughput delay, and the address driver delay. 



4. The delay to generate RAS, which includes the delay to generate the DRAM-CS signal. 



5. The row address hold time after the high-to-low transition of RAS. 

6. The time required to generate the multiplexer control signal (ROW/COL) after the row 
address hold time is satisfied. 

7. The time required to switch from a row to column address plus any driver delays. 

8. The delay to generate and drive the CAS signals. 

9. For a read transaction, the throughput delay of the data transceivers. For a write trans- 
action, the delay by the CPU to generate valid data. 

10. For a read transaction, the data setup time of the CPU. For a write transaction, the 
throughput delay of the data transceivers. 

1 1 . The time required to increment and drive the column address. 



12. For a write transaction only, the delay time to bring CAS high (terminate the CAS pulse 
for th e first data byte), to precharge the CAS pulse (required by the DRAM), and to assert 
CAS again. 



13. The RAS precharge time, which must be satisfied before another memory cycle can begin. 
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Figure 4-9: Timing Diagram for Two-word DRAM Read Transaction 
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Figure 4-10: Timing Diagram for Two-word DRAM Write Transaction 



DRAM Interleaving 

Because the DRAM consists of dynamic nodes, a row precharge time is required to recharge 
the nodes after every memory cycle. This time must be included in the timing evaluation, as 
noted by the example. To avoid the precharge time delay of the DRAM, the memory array can 
be arranged so that each subsequent memory access is most likely to be directed to a different 
bank. In this configuration, wait time between accesses is not required because while one bank 
of DRAMs performs the current access, another bank precharges and is ready to perform the 
next access iminediately. 

If DRAMs are interleaved (i.e., arranged in multiple banks so that adjacent addresses are in 
different banks), the DRAM precharge time can be masked for most accesses. With two banks 
of DRAMs, one for even 32-bit addresses and one for odd 32-bit addresses, all sequential 
32-bit accesses can be completed without waiting for the DRAM to precharge. 

Even when random accesses are made, two DRAM banks allow 50 percent of back-to-back 
accesses to be made without waiting for the DRAMs to precharge. The precharge time is also 
masked when the 80960KB processor has no bus accesses to be performed. During these idle 
bus cycles, the most recently accessed DRAM bank can precharge so that the next memory 
access to either bank can begin immediately. 
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SUMMARY 

The memory interface circuit allows the 80960KB processor to communicate with the memory 
devices. The basic memory interface logic can be divided into six blocks: the data 
transceivers, the address latches, the address decoder, the burst logic, the DRAM timing and 
control logic, and the byte enable latch. The DRAM controller and SRAM interface complete 
the memory interface circuit. The DRAM controller can be designed to take advantage of the 
80960KB processor's burst capability to enhance system performance. 

This chapter focuses on the design guidelines for the memory interface design to the 80960KB 
processor. Chapter 5 develops guidelines on designing peripheral devices in the single- 
processor hardware system. 
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CHAPTER 5 
I/O INTERFACE 



The 80960KB processor supports 8-bit, 16-bit, and 32-bit I/O devices that can be mapped into 
the 4G-byte memory address space. This chapter describes the design considerations for the 
interface between the 80960KB processor and I/O components. Several examples illustrate the 
design concepts. 

INTERFACING TO 8-BIT AND 16-BIT PERIPHERALS 

The 80960KB processor accesses I/O devices by using a memory-mapped address. Con- 
sequently, memory-type instructions can be used to perform input/output operations. For 
example, the 80960KB processor's LOAD and STORE instructions can be used to move 8-bit 
and 16-bit data to I/O peripherals. The instructions include those listed below. 

• Load Ordinal Byte (reads a byte) 

• Load Ordinal Short (reads 16-bit data) 

• Store Ordinal Byte (writes a byte) 

• Store Ordinal Short (writes 16-bit data) 

These instructions perform the transfer on the data bits specified by the two low-order lines of 
the effective address. See the 80960KB CPU Programmer's Reference Manual for complete 
details. 

GENERAL SYSTEM INTERFACE 

In a typical 80960KB processor system design, a number of slave I/O devices can be controlled 
through a general system interface. Other I/O devices, particularly those capable of controlling 
the L-bus, can use the general system interface, but may require additional logic to isolate the 
bus. This section describes the general system interface and assumes that the 80960KB 
processor does not perform burst transactions to the I/O devices. 

Figure 5-1 shows the major logic blocks of the general system interface. Standard 8-bit data 
transceivers add drive capability, provide bus isolation, and prevent bus conflicts that may 
occur with slow I/O components. The address latch demultiplexes the address/data lines and 
holds the address stable throughout the L-bus transaction. The address decoder generates the 
I/O chip- select signals from the latched address lines. The timing control block provides the 
READY signal to the 80960KB processor and the I/O read and I/O write command. 



5-1 



Intel 



I/O INTERFACE 



LAD31 -LADo 

DT/R 
DEN 



ALE 




80960KB 
PROCESSOR 



ADS 

READY 

W/R 



INTo 

INT1 

INT2/INTR 

INTa/INTA 

HOLD 

HLDA 



>^ 



y^ 



>^ 



CLK2. 



DATA 
D,R TRANSCEIVERS 



ADDRESS 
LATCHES 

Q 



ADDRESS 
DECODER 



TT 



-o| 



TIMING 
CONTROL 



DATA 



LOWER 
-^ ADDRESS LINES 
USED TO SELECT 
I/O REGISTERS 



I/O CHIP 
SELECT 
LINES 



-^^ l/ORD 



-^ l/OWR 



-INTERRUPT PINS 



Varbitration 

- y^PINS 



Figure 5-1 : Simplified General System Interface 
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This basic interface circuit is similar to the one used in the basic memory interface described in 
Chapter 4. For most systems the same data transceivers, address decoders, and address latches 
can be used to access both memory and I/O devices. The timing control logic can be designed 
to accommodate both memory and I/O devices. 

Data Transceivers 

Standard 8-bit transceivers can be used to provide isolation and additional drive capability for 
the L-bus. Transceivers prevent bus contention that can occur if some devices are slow to 
remove data from the data bus after a read cycle. For example, if an I/O write cycle follows a 
I/O read cycle, the 80960KB processor may drive the L-bus before a slow device has removed 
its outputs from the bus, potentially causing a current spike. Transceivers, however, can be 
omitted if the data float time of the device is short enough and the load does not exceed the 
80960KB device specifications. 

The data transceiver can be controlled by two signals from the 80960KB processor: Data 
Tra nsmit/ Receive (DT/R) and Data Enable (DEN). DT/^ indicates the direction of data flow 
and DEN enables the transceivers. 



Address Latch/Demultiplexer 

Standard transparent latches can be used to dem ultiplex the address/data lines of the 80960KB 
processor. The latch is controlled by the ALE signal from the 80960KB processor. The ALE 
signal passes through an inverter, s uch th at when ALE goes low, the address flows through the 
latch. The low-to-high transition of ALE can be used to latch the address. 

If only slave-type peripherals are used in a system, the output enable of the latches can always 
remain active by connecting it to ground. For systems with DMA devices, the output enable 
can be used to permit the DMA device to drive a common address bus. 

Address Decoder 

The address decoder determines which particular I/O device is selected by decoding the ad- 
dress. The I/O address can be any address in the 4G-word address range except for the upper 
16M bytes (addresses FFOOOOOOpj through FFFFFFFFj^), which the 80960KB processor 
reserves for inter-agent communication and internal I/O. Typically, a small range of address 
bits is reserved for accessing I/O devices by designating certain higher-order address bits to 
mean an I/O access. 

For example, consider a 32-bit address: A3J through Aj^ can indicate an I/O access when A3J 
is set to zero, and A3Q-AJ5 are set to one; Aj^ through A^ can be used to specify a particular 
I/O device; and A4 through A2 can be used to access up to 8 registers of the I/O component. 
A J and Aq are not used by the I/O device. This particular scheme selects up to 1,024 devices^ 
while using only 32K bytes of address space. 

The address decoder can be located either before or after the address latches. Usually, it is 
placed after the latches, so that the chip-select signal does not need to be latched. 
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Timing Control Logic 

The timing control logic accommo dates I/O devices that cannot transfer information at the 
maximum bus rate by generating a READY signal when the data is available. The timing 
control logic consists of a counter and timing logic, as shown in Figure 5-2. The counter 
produces a 4- bit bi nary coun t. The count is sta rted at t he beginning of the operation 
(det ermined by ADS and DEN) and is stopped by the R EADY signal. The timing logic asserts 
the READY signal, the I/O write command (I/O-W R), and the I/O read_command (I/O-RD) 
based upon the clock count, the I/O chip select signal (I/O-CS), and the W/R command. 
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Figure 5-2: Timing Control Block Diagram 



For many peripherals, the timing logic can be programmed to assert READY at the appropriate 
count for the selected device. Specific I /O chip s elect signals can be used to indicate how 
many clock cycles to wait before asserting READY. 



For some I/O peripherals, particularly bus masters, READY cannot be determined by counting 
clock cycles. For these I/O devices, READY can be supplied by the device and passed on to 
80960KB processor. 



The timing control block can assert the I/O-RD or I/O-WR signal for I/O devices based upon 
the clock count. The timing for these signals can be selected for the slowest device to simpUfy 
the logic circuit or can be customized for each individual peripheral device to maximize 
performance. 
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I/O INTERFACE DESIGN EXAMPLES 

The general system interface shown in Figure 5-1 can be used to connect the 8096dKB 
processor to many slave peripherals. The following list includes some common peripherals 
compatible with this interface: 

• 8259A Programmable interrupt Controller 

• 8253, 8254 Programmable Interval Timer 

• 8272 Floppy Disk Controller 

• 82062, 82064 Fixed Disk Controller 

• 82510, Asynchronous Serial Controller 

• 8274, 82530 Multi-Protocol Serial Controller 

• 8255 Programmable Peripheral Interface 

• 8041, 8042 Universal Peripheral Interface 

This section provides guidelines and design considerations for interfacing the 80960KB 
processor to different types of I/O configurations. Specifically, four design examples are 
examined. The 8259A and 82530 design examples show how to interface the 80960KB 
processor to a slave-type peripheral device. The 82586 design example shows how a 16-bit 
bus master reads and writes to the 80960KB processor's system memory. The 82786 design 
example shows how the 80960KB processor can read or write to graphics memory using a 
16-bit data bus. 

8259A Programmable Interrupt Controller 

The 8259A Programmable Interrupt Controller is designed for use in interrupt-driven 
microcomputer systems, where it manages up to eight independent interrupts. The 8259 A 
handles interrupt priority resolution and returns an 8-bit vector to the 80960KB processor 
during an interrupt-acknowledge cycle. Intel Application Note AP-59 contains detailed infor- 
mation on configurations of the 8259A. 

Interface 

Figure 5-3 shows the connection of the 80960KB processor to a single 8259A Interrupt Con- 
troller. This circuit consists of the general system interface plus a bidirectional buffer. The 
example assumes that several interrupt requests occur at the same time so that priority resolu- 
tion is needed. 
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Figure 5-3: Block Diagram for 8259A Interface 

The data lines from the 8259 A are not directly aligned to the 80960KB processor because of 
the difference in priority resolution between the devices. Although both devices use an 8-bit 
interrupt vector, the 80960KB processor implicitly defines the priority by dividing the interrupt 
vector by eight. The 8259A defines the priority in the lower three bits of the interrupt vector. 
Furthermore, the highest priority vector of the 80960KB processor has a value of 31 in the 
upper five bits of the interrupt vector. Whereas, the highest priority interrupt of the 8259A has 
a value of in the lower three bits of the interrupt vector. 

To resolve the priority difference, the interrupt vector from the 8259A can be inverted and 
rotated left by three bits as shown by the data alignment between the 80960KB processor and 
8259 A in Figure 5-3. Rotating the data bits in this manner provides two advantages: the 
interrupt table for the 8259A can be located by contiguous addresses, and the upper two most 
significant bits of the interrupt vector rem.ain free to group interrupt vectors if additional 
8259As are needed. 

Care must be exercised, however, when programming the registers of the 8259A. For ex- 
ample, assume that the second initialization command word (ICW2 register) of the 8259 A 
requires a data byte value of OOOlllllg. To transfer the correct information, the 80960KB 



5-6 



iny 



I/O INTERFACE 



processor needs to write a data word with the value of 000001 1 Ig because this word is rotated 
left three places and inverted. 

Operation 

The 8259A starts the interrupt cycle by generating an interrupt request (INT) to the 80960KB 
processor, which receives the signal at the INTR input pin. This assumes the Interrupt Control 
register of the 80960KB processor is set to accommodate an external interrupt controller. 



When the 80960 KB proc essor comes to a breakpoint in its execution, it asserts the INTA signal 
twice. The first INTA signal acknowledges the interrupt request and ca uses the 8259A to 
prioritize t he interrupt requests it received up to this point. The IN TA, together with the 
8259 A-CS, are applied to the timing control logic to generate a READY signal. 



The 80960KB processor automatically asserts the secon d INTA signal five clock cycles after 
the assertion of READY. After the second assertion of INTA, the 80960KB processor reads 
the interrupt vector from the 8259A. 

The bidirectional buffer inverts and passes the 8-bit vector to the 80960KB processor with the 
appropriate lines rearranged. The output enable signal for the data buffer is controlled by 
INTA for this op eration. After the data transfer is completed, the timing control circuit 
generates a second READY signal to terminate the interrupt acknowledge cycle. 

The same circuitry can be used to read or write to the 8259A registers. In this case, the 
80960KB processor selects the 8259A through a memory-mapped address. Local address line 
2 (A2) selects one of two internal registers of the 8259A. The I/O read or I/O write command 
is generated by the timing control circuit. The data passes through the bidirectional data buffer 
to or from the selected register of the 8259A. 

The direction of data flow through the buffer is controlled by thr ee logic gat es shown in Figure 
5-3. For an I/O write operation, the I/O write command and 8259 A-CS signal control the 
output enable signal of the bidirectional buffer. Similarly, for a read operation, the I/O read 
command and the 8259A-CS signal control the output e nable signal of the latch. After the data 
is transferred, the timing control circuit asserts READY. 

82530 Serial Communication Controller Example 

The 82530 Serial Communication Controller is a dual-channel, multi-protocol controller with 
on-chip baud rate generators, digital phase locked loops, various data encoding/decodings, and 
extensive diagnostic capabilities. The 82530 is designed to interface with high-speed serial 
communications lines using a variety of communication protocols, including asynchronous, 
synchronous, and HDLC/SDLC protocols. The 82530 contains two independent full-duplex 
channels. 

The general system interface circuit previously described can be used to connect the 80960KB 
processor to the 82530, as shown in Figure 5-4. The 82530 can send an interrupt request to the 
80960KB processor as shown or it can send the interrupt request to an interrupt controller, 
which in turn sends it to the 80960KB processor. The 80960KB processor responds to the 
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interrupt request and issues an a ddress. A fter the address is latched, the address lines are 
decoded to generate a chip-select (82530-CS) signal to activate the 82530. 
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Figure 5-4: Block Diagram for 82530 Interface 

The lower two address lines, A2 and A3, are used for channel selection and command/data 
selection. A2 is connected to the Channel- A/Channel-B (A/B) select input pin. This selects 
the channel that performs the serial read or write operations. A3 is connected to the 
Data/Command (D/C) select input pin. This signal defines the type of information transferred 
to or from the 82530 on the data lines (D^ through Dq). A high level means data is transferred; 
a low level indicates a command. 
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The timing control circuit generates an I/O read or I/O write command based upon the W/R 
command from the 8 0960KB processor. When the data transfer is completed, the timing 
control circuit sends a READY signal to terminate the transaction. 

The baud rate clocks can be programmed in several ways, including use of an external crystal. 

82586 Local Area Network Coprocessor Example 

The 82586 is an intelligent, high-performance communications controller designed to perform 
most tasks required for controlling access to a local area network (LAN), such as Ethernet or 
Starlan. In many applications, the 82586 is the communication manager for a station con- 
nected to a LAN controller. Such a station usually includes a host CPU, shared memory, a 
Serial Interface Unit, a transceiver, and LAN controller link, as shown in Figure 5-5. The 
82586 performs all functions associated with data transfer between the shared memory and the 
LAN link, including: 

Framing 

Link management 

Address filtering 

Error detection 

Data encoding 

Network management 

Direct memory access 

Buffer chaining 

High-level (user) command interpretation 
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Figure 5-5: LAN Station 

The 82586 has two interfaces: a 16-bit bus interface and a network interface to the Serial 
Interface Unit. The bus interface is described here. For detailed information on using the 
82586, refer to the Local Area Networking Component User's Manual. 

Interface 

There are several ways to design an interface between the 82586 and the 80960KB processor. 
The chosen design example shows how to interface the 82586 using a shared bus. In this 
example, the 82586 operates in minimum mode at one-half the processor clock frequency. 

The primary function of the interface circuit is to allow the 82586 to read and write 16-bit data 
using the 32-bit L-bus. This is accompHshed by adding the high-order address lines and 
translating the 16-bit data lines to the 32-bit data lines by using byte enable signals. 
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Figure 5-6 shows the 82586 interface circuit, which includes the DRAM controller (see the 
"DRAM Controller" section in Chapter 4 on page 4-11). This interface uses the general system 
interface circuit plus other logic units that specifically pertain to the 82586: the LAN data 
transceivers, the byte enable converter, and the LAN address latches. These logic blocks are 
highlighted by the shaded boxes. 
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Figure 5-6: Block Diagram for LAN Controller Interface 

The LAN data transceivers connect 16 data lines from the 82586 to both the upp er and lower 
16 bits of the L-bus. The data transfer is controlled by converting Aq, Aj, and the BHE to four 
byte enable signals as shown in Figure 5-7. Aj selects between the upper and lower 16-bit data 
lines; Aq selects the lowe r data byte for either the upper or lower 16-bit data lines; and the byte 
high enable signal (BHE) selects the upper data byte for either the upper or lower 16-bit data 
lines. Data flows through the buffers when the appropriate byte enable signal is asserted. The 
direction of the data flow is controlled by the DTfR signal of the 82586. 
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Figure 5-7: Byte Enable Generation Circuit 

The L AN address latches are used to demultiplex ADj^ through ADq. The address lines and 
BHE are latched by the ALE signal from the 82586. The upper address lines (A3J through 
Aj^) are generated by hardware programmable DIP switches. 

the 82586 begins operation when the Channel Attention (CA) input signal is asserted. This 
signal is generated by gating the write command and 82586 chip select signal. 



Operation 

The interaction between the 82586 and the 80960KB processor is described below and is 
sumrharized in Figure 5-8. 

• The 80960KB processor invokes the 82586 by supplying a me mory-mapp ed address and a 
write command* The memory-mapped address results in a 82586-CS signal, which is 
gated with a write command to produce the CA signal. 

• The 82586 responds by generating a hold request and waits for HLDA. 

• The 80960KB processor asserts HLDA, which enables the outputs of the LAN address 
latches and disables the outputs of the address latches next to the 80960KB processor. The 
HLDA signal also gives control of the L-bus to the 82586. 

• After the 82586 takes contr ol of the bus, it generates a 16-bit address (ADj^ through ADq), 
an ALE signal, and a BHE signal. The upper address lines are provided by the programm- 
able DIP switches to produce an address on the L-bus. 



A^ and Aq (from th e 825 86), and BHE are decoded to generate four byte enable signals 
(BE3 through BEq). DEN enables the output of the byte enable converter. 

DT/R from the 82586 controls the direction of the data flow through the buffers. 
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The read or write signal from the 82586 is applied to the DRAM controller. 
The 82586 accesses DRAM by using the DRAM controller. 



The DRAM-RDY is asserted by the DRAM controller. This action enables the output of 
the LAN data t ransceiver and terminates the 8258 6 memory cycle. The timing control 
logic passes the DRAM-RDY signal as the READY signal to the 82586. 

The 82586 deasserts HOLD and the 80960KB processor deasserts HLDA. The 80960KB 
processor regains control of bus. 
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82586 RETURNS CONTROL OF THE BUS TO THE 80950 



Figure 5-8: Operational Flow Diagram for 82586 Interface 



82786 Graphics Coprocessor Example 

The 82786 is a high performance graphics processor that provides high quahty text and ad- 
vanced display control. It provides full i^upport for graphics primitives at up to 25 million 
pixels per second and bit-mapped text up to 25 thousand characters per second. This graphics 



5-14 



iny 



I/O INTERFACE 



processor supports advanced features such as hardware windows, zooming, panning, and 
scrolling. Intel Application Note AP-259 and Application Note AP-270 contain detailed infor- 
mation on 82786. 

When using the 82786, it may be necessary for the 80960KB processor to write to graphics 
memory. The interface design example illustrates how the 80960KB processor can transfer a 
32-bit data word to the 16-bit data bus of the 82786. 



Interface 

There are several ways to design an interface between the 82786 and the 80960KB processor. 
In this example, the 80960KB processor reads or writes to graphics memory by accessing the 
82786 through the interface logic circuit. This example assumes that the 82786 operates in the 
slave mode, and that the 80960KB processor does not perform burst transfers. The 80960KB 
processor only performs burst transfers for instructions that specify accesses for more than one 
word or for instruction fetches. 

The interface circuit translates a 32-bit data bus to a 16-bit data bus by dividing the data lines 
into the upper and lower 16 bits and sequencing the data transmission. When the 80960KB 
processor writes to graphics memory, the bidirectional transceivers sequence the lower and the 
upper data bits of the L-bus to the 16-bit data bus of the 82786. 

The process is reversed when the 80960KB processor reads from graphics memory. The 
bidirectional transceivers form a 32-bit data word by latching the first 16-bit data word on the 
lower data lines, routing the next 16 bits to the upper data lines, and then passing the 32-bit 
data word on the L-bus. 

Figure 5-9 shows the details of the graphics controller interface circuit. This interface uses the 
general system interface circuit plus the following logic units: the bidirectional transceivers, 
the data buffer control, the data bus controller, and the address translator. These logic blocks 
are highlighted by the shaded boxes. 
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Figure 5-9: Block Diagram for 82786 Interface 

The bidirectional transceivers pass data to (from) a 32-bit data bus from (to) a 16-bit data bus. 
Data is sequenced through the transceivers by the control signals generated by the data buffer 
control logic. 

The data buffer control logic generates the signals that operate and sequence the bidirectional 
transceivers. The direction signal for data flow through the transceivers is derived from the 
W/R signal of the 80960KB processor. The data buffer control logic genera tes fo ur output 
enable signals: GABj^ enables the outputs on the B side for the lower 16 bits; GBA^ enables 
the outputs on the A side f or the lower 16 bits; GABj^ enables the outputs on the B side for the 
higher 16 bits; and GBAjj enables the outputs on the A side for the higher 16 bits. These 
output enable signals are derived from the byte enable signals and are asserted when the slave 
enable signal (SEN) is activated by the 82786, 

The select lines for the bidirectional transceivers allow data to flow from either the latched data 
or the input pins. These lines, which are not shown, can be hardwired. 

The d ata bus co ntroller provides the read and write commands, memory or I/O signal (M/IO), 
and a READYq signal. This circuit generates two read or write commands for every 32-bit 
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data transfer to or from the 80960KB processor (one for each 1 6-bit data transfer). The data 
bus controller starts counting clock cycles when the 82786-CS and CYCLE-IN-PROGRESS 
signals are asserted. At the proper time (b ased upon clock counts), it asserts the read/write 
comma nd. The d ata bus controller produces READYq after receiving the SEN signal from the 
82786. READYq resets the count, and another read/write command is generated. 

The addr ess tra nslator performs four functions: it c onverts t he four byte enable signals to Aq, 
Ap and BHE; it increments Aj after receiving READYq for the first 16-bit transfer; it 
generates the clock signal (CBA^) that latches the first 16-bit data word in the bidirectional 
transceiv ers when the 80960KB processor performs a read operation; and it generates the 
READY signal for the CPU. 

Not shown is the cycle detector circuit that generates the CYCLE-IN-PROGRESS signal. This 
signal can be generated by using the circ uit sim ilar to the one shown in Figure 5-2. The start of 
the cycle ca n be dete cted by gating the ADS and DEN signals. The end of the cycle can be 
indicated by READY. 

Operation 

The interaction between the 82786 and the 80960KB processor is summarized in Figure 5-10. 
The operation is divided into two 16-bit data movements for both a read and write operation. 
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Figure 5-10: Operational Flow Diagram for 82786 Interface Circuit 
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The 80960KB processor generates a memory-mapped address and data for the desired graphics 
memory location. It accesses the 82786 by triggering the interface circu it to generat e the chip 
select signal and several operational signals: the read (RD) or write (WR) command, BHE, and 
the memory or I/O (M/IO) signal. The 82786 begins the memory operation after it completes 
the current graphics processing activity. The 82786 acknowledges that it is performing a 
memory operation by asserting SEN. 

After the 82786 asserts SEN, it begins a 16-bit memory read or write operation by translating 
the address inputs (A2j through Aq) to a multiplexed DRAM address, and generating the 
DRAM control signals. Note that Aj and Aq are derived from the byte enable signals. 



For a read o peration, the data bus controller uses SEN to generate the READYq signal. The 
assertion of READYq causes the address translator to increment Aj and to generate CBA^, 
which latches the lower 16 data bits on the B inputs of the bidirectional transceivers to the A 
side. 



Similarly, for a write op eration, th e data bus controller uses SEN to generate the READYq 
signal. The assertion of READYq causes the address translator to increment Aj. The data 
buffer control uses SEN and the byte enable signals to produce GAB^, which enable the 
outputs for the lower 16 data bits of the bidirectional transceivers. 

The 82786 automatically deasserts SEN and the transfer of the first 16 data bits is complete. 
To transfer the seco nd 1 6 data b its, the interface circuit requests another memory operation by 
generating RD (or WR), BHE, and M/IO (CS is already asserted). After it completes the 
current graphics processing activity, the 82786 begins the memory operation and asserts SEN. 



For a read operation, the data bus contr oller u ses S EN to generate the READYq signal. The 
data buffer control uses SEN to assert GBAjj and GBA^, which enable the outputs for the 
higher and lower 16 data bits. 



For a write operation, the data bus controller uses SEN to generate the READYq signal. The 
data buffer control uses SEN and the byte enable signals to produce GABj^, which enable the 
outputs for the higher 16 data bits of the bidirectional transceivers. 



The add ress translator generates READY for the 80960KB processor from the second 
READYq to terminate the data transfer to the graphics memory. 

SUMMARY 

The 80960KB processor supports 8-bit, 16-bit, and 32-bit I/O interfaces. A general system 
interface circuit can be designed that connects to many slave-type peripherals. This interface 
can be expanded to accommodate a bus master peripheral or a 32-bit to 16-bit data bus 
translator. These interfaces were illustrated by four design examples. 



5-19 



Index 



INDEX 



82530/80960KB Interface 5-7 
82586/80960KB Interface 5-9 
8259A/80960KB Interface 5-5 
82786/80960KB Interface 5-14 

A 

Address Decoder 

I/O interface 5-3 

memory interface 4-2 
Address Latch/Demultiplexer 

I/O interface 5-3 

memory interface 4-2 
ADS (Address/Data Status) Signal 

definition 3-4 

timing diagram 3-9 

used by the 82786 interface 5-17 

used by the burst logic 4-3 

used by the timing control logic 5-4 
ALE (Address Latch Enable) Signal 

definition 3-4 

timing diagram 3-8 

used by an address latch/demultiplexer 
4-2, 5-3 

used by the byte enable latch 4-6 

used by the SRAM interface logic 4-8 

used in the SRAM interface logic 4-10 
Arbitration 

L-bus example 3-23 

protocol for the L-bus 3-17 

timing on the L-bus 3-19 

B 

BAD AC (Bad Access) Signal 3-37 
BE3-BEQ (Byte Enable) Signals 

definition 3-4 

timing 3-5 

timing diagram 3-9 

used by the byte enable latch 4-6 

used by the DRAM controller 4-16 



Burst Logic 

memory interface 4-3 
signal flow 4-4 

B urst Transaction 3-12 



CACHE Signal 3-6 

CLK2 (Processor Clock) or CLK (Bus 
Clock) 

CLK2 requirements 3-14 

generation 3-15 

relationship of CLK2 and CLK 3-8 
CYCLE-IN-PROGRESS Signal 

definition 4-3 

used by the burst logic 4-3 

used by the DRAM arbiter 4- 1 3 

used by the DRAM timing and control 
4-14,4-16 

used by the timing control logic 4-5 



Data Transceivers 

I/O interface 5-3 

memory interface 4- 1 
DEN (Data Enable) Signal 

definition 3-4 

timing diagram 3-9 

used by a data transceiver 4-1,5-3 

used by the burst logic 4-3 
DRAM Address Multiplexer 4-13 
DRAM Arbiter 4-13 
DRAM Controller 4-11 
DRAM Interleaving 4-19 
DRAM Refresh Interval Timer 4- 1 3 
DRAM Timing and Control 4-13 
DRAM Timing Considerations 4-17 
DT^ (Data Transmit/Receive) Signal 

definition 3-4 

timing diagram 3-9 



1-1 



■Iter 



INDEX 



used by a data transceiver 4-1,5-3 
used by the 82586 interface 5-11,5-12 



FAILURE Signal 3-37 

H 

HLDA (Hold Acknowledge) Signal 3-18 
HLDAR (Hold Acknowledge Request) Sig- 
nal 3-20 
HOLD Signal 3-18 
HOLDR (Hold Request) Signal 3-20 

I 

I/O Address Range 5-3 

I/O Interface to 8-bit and 16-bit Peripherals 

5-1 
I/O Interface to the 80960KB 5-1 
Initialization for 80960KB 3-34 
INTq/IAC (Interrupt^ or Inter- Agent 

Communication) Signal 3-29 
INTj (Interruptj) Signal 3-29 
INT2/INTR (Interrupt2 or Interrupt Request) 

Signal 3-29 
INT3/INTA (Interrupt3 or Interrupt 

Acknowledge) Signal 3-30 
Interrupts 

definition 3-29 
direct interrupt pins 3-31 
Interrupt Control register 3-30 
pins that interface to an interrupt con- 
troller 3-31 
signals 3-29 
synchronization 3-32 
timing diagram 3-31 

L 

L-Bus States 

address (T;) 3-21,3-1 
data(T^) 3-21,3-1 
hold(Tj^) 3-18 
hold request (\^) 3-21 



idle(T.) 3-21,3-1 

recovery (T^) 3-21,3-1 

wait(T^) 3-21,3-1 
LAD (Local Address/Data) lines 3-4, 3-3 
LADj-LADq 

See SIZE Signals 
LOCK Signal 

definition 3-6 

used during arbitration 3-21 

M 

Memory Address Range 4-3 
Memory Interface to the 80960KB 4-1 



PBM (Primary Bus Master) 3-20 



Read Operation, timing diagram for the L- 
bus 3-8 

READY Signal 
definition 3-5 
timing diagram 3-9 
used by the 82530 interface 5-9 
used by the 82586 interface 5- 1 3 
used by the 8259A interface 5-7 
used by the 82786 interface 5-17 
used by the byte enable latch 4-6 
used by the SRAM interface 4-8, 4-9, 

4-10,4-11 
used by the timing control logic 4-5, 
5-4 

RESET 

timing generation for 80960KB 3-33 
timing requirements for 80960KB 3-33 



SBM (Secondary Bus Master) 3-20 

SIZE Signals 
definition 3-3 

used by the 82586 interface 5-12 
used by the burst logic 4-3 



1-2 



iny 



INDEX 



SRAM Interface 

interface logic 4-6 
timing considerations 4-7 

T 

Timing 

arbitration on the L-bus 3-19 

interrupts 3-31 

read operation on the L-bus 3-8 

write operation on the L-bus 3-11 
Timing Logic 

I/O interface 5-4 

signal flow 4-5 

w 

W/R (Write/Read) Signal 
definition 3-4 
timing diagram 3-9 
used by the 82530 interface 5-9 
used by the 82786 interface 5-16 
used by the DRAM timing control logic 

4-14 
used by the timing control logic 5-4, 

4-5 
Write Operation, timing diagram on the L- 

bus 3-11 



1-3 



iny 



NAME: 

COMPANY: 
ADDRESS: . 



LITERATURE SALES FORM (EUROPE) 



PHONE NO.: 



ORDER NO. 










_ 



TITLE 



QTY. 



PRICE 



TOTAL 



X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 



Subtotal 

Your Local Sales Tax 

Postage 

Total 



PAYMENT 

Cheques should be made payable to your local Intel Sales Office. 

Other forms of payment may be available in your country. Please contact the Literature Coordinator at your 
local Intel Sales Office for details. 

The Completed form should be marked for the attention of the LITERATURE CO-ORDINATOR and returned 
to your local Intel Sales Office. 



UNITED STATES 
Intel Corporation 
3065 Bowers Avenue 
Santa Clara, CA 95051 

JAPAN 

Intel Japan K.K. 

5-6 Tokodai, Tsukuba-shi 

Ibaraki, 300-26 

FRANCE 

Intel Paris 

1 Rue Edison, BP 303 

78054 Saint-Quentin-en-Yvelines Cedex 

UNITED KINGDOM 

Intel Corporation (U.K.) Ltd. 

Pipers Way 

Swindon 

Watshire, England SN3 IRJ 

WEST GERMANY 
Intel Semiconductor GmbH 
Seidlstrasse 27 
D-8000Muenchen2 

HONGKONG 
Intel Semiconductor Ltd. 
1701-3 Connaught Centre 
1 Connaught Road 

CANADA 

Intel Semiconductor of Canada, Ltd. 
190 Attwell Drive, Suite 500 
Rexdale, Ontario M9W 6H8 



Printed in U.S.A 0388 15K RRD SM 
Embedded Controllers 



ISBN 1-55512-033-4 



